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Preface 


Algebra has played a central and decisive role in all branches of mathematics and, 
in turn, in all branches of science and engineering. It is not possible for a lecturer to 
cover, physically in a classroom, the amount of algebra which a graduate student 
(irrespective of the branch of science, engineering, or mathematics in which he 
prefers to specialize) needs to master. In addition, there are a variety of students in a 
class. Some of them grasp the material very fast and do not need much of assis- 
tance. At the same time, there are serious students who can do equally well by 
putting a little more effort. They need some more illustrations and also more 
exercises to develop their skill and confidence in the subject by solving problems on 
their own. Again, it is not possible for a lecturer to do sufficiently many illustrations 
and exercises in the classroom for the purpose. This is one of the considerations 
which prompted me to write a series of three volumes on the subject starting from 
the undergraduate level to the advance postgraduate level. Each volume is suffi- 
ciently rich with illustrations and examples together with numerous exercises. 
These volumes also cater for the need of the talented students with difficult, 
challenging, and motivating exercises which were responsible for the further 
developments in mathematics. Occasionally, the exercises demonstrating the 
applications in different disciplines are also included. The books may also act as a 
guide to teachers giving the courses. The researchers working in the field may also 
find it useful. 

The present (first) volume consists of 11 chapters which starts with language of 
mathematics (logic and set theory) and centers around the introduction to basic 
algebraic structures, viz. group, rings, polynomial rings, and fields, together with 
fundamentals in arithmetic. At the end of this volume, there is an appendix on the 
basics of category theory. This volume serves as a basic text for the first-year course 
in algebra at the undergraduate level. Since this is the first introduction to the 
abstract-algebraic structures, we proceed rather leisurely in this volume as com- 
pared with the other volumes. 

The second volume contains ten chapters which includes the fundamentals of 
linear algebra, structure theory of fields and Galois theory, representation theory of 
finite groups, and the theory of group extensions. It is needless to say that linear 
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algebra is the most applicable branch of mathematics and it is essential for students 
of any discipline to develop expertise in the same. As such, linear algebra is an 
integral part of the syllabus at the undergraduate level. General linear algebra, 
Galois theory, representation theory of groups, and the theory of group extensions 
follow linear algebra which is a part, and indeed, these are parts of syllabus for the 
second- and third-year students of most of the universities. As such, this volume 
may serve as a basic text for second- and third-year courses in algebra. 

The third volume of the book also contains 10 chapters, and it can act as a text 
for graduate and advanced postgraduate students specializing in mathematics. This 
includes commutative algebra, basics in algebraic geometry, homological methods, 
semisimple Lie algebra, and Chevalley groups. The table of contents gives an idea 
of the subject matter covered in the book. 

There is no prerequisite essential for the book except, occasionally, in some 
illustrations and starred exercises, some amount of calculus, geometry, or topology 
may be needed. An attempt to follow the logical ordering has been made throughout 
the book. 

My teacher (Late) Prof. B.L. Sharma, my colleague at the University of 
Allahabad, my friend Dr. H.S. Tripathi, my students Prof. R.P. Shukla, Prof. 
Shivdatt, Dr. Brajesh Kumar Sharma, Mr. Swapnil Srivastava, Dr. Akhilesh Yadav, 
Dr. Vivek Jain, Dr. Vipul Kakkar, and above all the mathematics students of the 
University of Allahabad had always been the motivating force for me to write a 
book. Without their continuous insistence, it would have not come in the present 
form. I wish to express my warmest thanks to all of them. 

Harish-Chandra Research Institute (HRD, Allahabad, has always been a great 
source for me to learn more and more mathematics. I wish to express my deep sense 
of appreciation and thanks to HRI for providing me all the infrastructural facilities 
to write these volumes. 

Last but not least, I wish to express my thanks to my wife Veena Srivastava who 
had always been helpful in this endeavor. 

In spite of all care, some mistakes and misprint might have crept in and escaped 
my attention. I shall be grateful to any such attention. Criticisms and suggestions for 
the improvement of the book will be appreciated and gratefully acknowledged. 


Allahabad, India Ramji Lal 
April 2017 
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Chapter 1 
Language of Mathematics 1 (Logic) 


The principal aim of this small and brief chapter is to provide a logical foundation 
to sound mathematical reasoning, and also to understand adequately the notion of 
a mathematical proof. Indeed, the incidence of paradoxes (Russell’s and Cantor’s 
paradoxes) during the turn of the 19th century led to a strong desire among mathe- 
maticians to have a rigorous foundation to all disciplines in mathematics. In logic, 
the interest is in the form rather than the content of the statements. 


1.1 Statements, Propositional Connectives 


In mathematics, we are concerned about the truth or the falsity of the statements 
involving mathematical objects. Yet, one need not take the trouble to define a state- 
ment. It is a primitive notion which everyone inherits. Following are some examples 
of statements. 


. Man is the most intelligent creature on the earth. 

. Charu is a brave girl, and Garima is an honest girl. 

. Sun rises from the east or sun rises from the west. 

. Shipra will not go to school. 

. If Gaurav works hard, then he will pass. 

. Gunjan can be honest if and only if she is brave. 

. ‘Kishore has a wife’ implies ‘he is married.’ 

. ‘Indira Gandhi died martyr’ implies and implied by ‘she was brave.’ 
. For every river, there is an origin. 

. There exists a man who is immortal. 


STOmANINANFWN KE 


a" 


The sentences “Who is the present President of India?’, “When did you come?’, 
and ‘Bring me a glass of water’ are not statements. A statement asserts something(true 
or false). 
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We have some operations on the class of statements, namely ‘and’, ‘or’, ‘If ..., 
then ...’, ‘if and only if’ (briefly iff), ‘implies’, and ‘not’. In fact, we consider a 
suitable class of statements (called the valid statements) which is closed under the 
above operations. These operations are called the propositional connectives. 

The rules which govern the formation of valid statements are in very much use (like 
those of english grammar) without being conscious of the fact, and it forms the content 
of the propositional calculus. For the formal development of the language, one is 
referred to an excellent book entitled ‘Set Theory and Continuum Hypothesis’ by 
P.J. Cohen. Here, in this text, we shall adopt rather the traditional informal language. 


Conjunction 


The propositional connective ‘and’ is used to conjoin two statements. The conjunc- 
tion of a statement P and a statement Q is written as ‘P and Q’. The symbol ‘/\’ 
is also used for ‘and’. Thus, ‘P /\ Q’ also denotes the conjunction of P and Q. The 
example 2 above is an example of a conjunction. 


Disjunction 


The propositional connective ‘or’ is used to obtain the disjunction of two statements. 
The disjunction of a statement ‘P’ and a statement ‘Q’ is written as ‘P or Q’. The 
symbol ‘\/’ is also used for ‘or’. The disjunction of a statement ‘P’ and a statement 
‘Q’ is also written as ‘P \/ Q’. The example 3 above is an example of a disjunction. 


Negation 


Usually ‘not’ is used at a suitable place in a statement to obtain the negation of the 
statement. The negation of a statement ‘P’ is denoted by ‘-P’. The example 4 above 
is the negation of ‘Shipra will go to school’. The negation of this statement can also 
be expressed by ‘-(Shipra will go to school)’. 


Conditional statement 


A statement of the form ‘Jf P, then Q’ is called a conditional statement. The state- 
ment ‘P’ is called the antecedent or the hypothesis, and ‘Q’ is called the consequent 
or the conclusion. The example 5 above is a conditional statement. ‘If P, then Q’ is 
also expressed by saying that ‘Q is a necessary condition for P’. An other way to 
express it is to say that ‘P is a sufficient condition for Q’. 


Implication 


A statement of the form ‘P implies Q’ (in symbol ‘P = > Q’)is called an implication. 
The statement ‘P = > Q’ and the statement ‘If P, then Q’ are logically same, for 
(as we shall see) the truth values of both the statements are always same. Again, “P’ 
is called the antecedent or the hypothesis, and ‘Q’ is called the consequent or the 
conclusion. Example 7 is an implication. 


Equivalence 

A statement of the form ‘P if and only if Q’ (briefly ‘P iff Q’) is called an equivalence. 
‘P implies and implied by Q’ (in symbol ‘P <=> Q’) is logically same as ‘P if and 
only if Q’. We also express it by saying that ‘P is a necessary and sufficient condition 
for Q.’ Examples 6 and 7 are equivalences. 
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1.2 Statement Formula and Truth Functional Rules 


A statement variable is a variable which can take any value from the class of valid 
atomic statements (statements without propositional connectives). We use the nota- 
tions P, Q, R, etc. for the statement variables. A well-formed statement formula 
is a finite string of the statement variables, the propositional connectives, and the 
parenthesis limiting the scopes of connectives. Thus, for example, 


(P => Q) => (-P\/ 0), 


(P => 0) \/(Q = P) 


PVOARSePVOACV® 


are well-formed statement formulas. 

The rules of dependence of the truth value of a statement formula on the truth 
values of its statement variables (atomic parts) (which are prompted by our common 
sense) are called the truth functional rules. These rules are illustrated by tables called 
the truth tables. 


and 


The truth functional rule for the conjunction ‘P /\ Q’ 


The statement formula ‘P /\ Q’ is true only in case both P as well as Q are true. 
Thus, the truth functional rule for ‘P /\ Q’ is given by the table 


The truth functional rule for the disjunction ‘P \/ Q’ 


The statement formula ‘P \/ Q’ is true if at least one of P and Q is true. The table 
giving the truth functional rule for ‘P \/ Q’ is 


PIO|P VO 
T\T T 
T\F T 
F\T T 
F\F| F 


The truth functional rule for the negation ‘(—P)’ 
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The negation of a true statement is false and that of a false statement is true. Thus, 


P|—P 
T| F 
Fi T 


The truth functional rule for ‘Jf P, then Q’ ((P => Q’). 


The statement formula ‘Jf P, then Q’ (‘P => Q?’) is false in only one case when 
P is true but Q is false. Take, for example, the statement ‘If a student works hard, 
then he will pass.’ The truth of this statement says that if some student works hard, 
then he will pass. If there is some student who has not worked hard, then whether he 
passes, or he fails, the truth of the statement remains unchallenged. Thus, the truth 
table for ‘If P, then Q’ is as follows: 


P\Q\UIf P, then Q|P=>@Q 
T|T T T 
T|F F F 
F\T T T 
F\F T T 


The statement ‘P< Q’ is the conjunction of the statement “P= >Q’ and the 
statement ‘O— > P’. Thus, the truth table for the equivalence ‘P<—>(Q’ is as follows: 


P\O|P => O|P if O 
TIT| -£ T 
TIFl OF F 
F\T| F F 
F|F| T il 


Example 1.2.1 Truth table for the statement formula ‘(P \/ Q) => (Q \/ P)’ 


PIQiPV gia V PIP V Q) => (VP) 
ae T 
fie) © |e T 
ace i a es T 
FIF| F | F T 


Example 1.2.2 Truth table for the statement formula ‘(P \/ Q) => (P A Q)’ 


PVQO/PAQ(PV Q) = PAQ) 


APRS 


Q 
T 
F 
T 
F 


5) | i | 
qT AS 
Say ay 
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Example 1.2.3 Truth table for the statement formula ‘[(P \/ Q) A —P] => Q’ 


PIQiPV Q|-P(P VQ) A-PILP V Q) A -Pl = @ 
T|T| T |F F T 
T|F| T |F F T 
FIT| T [T c T 
F|F| F [T F T 


—P)’ 


PIQiPV Q|-(P V Q)|—P|-Q|—P \ —Q|—(P V Q) => (—P A -9) 
T|T| T F |F|F| F iy 
T|F| T F |F|T F ij 
FIT| T F |T|F| F i 
FI|F| F me ae: T T 


Example 1.2.6 Truth table for the statement formula ‘P \J —P’ 


P|—P|PV —P 
FFF 
FIT| T 


Example 1.2.7 Truth table for the statement formula ‘—(P=>Q)=>(P /\ —Q)’ 
Q|P => Q|-(P => Q)|-9|/P A -9|-P = 2) => PA-Q) 


>] ee] ee nee 
>] ae >| ae! 


APRP aS 
ya] a] 
pel eel lioe| | 
eo] >| Mee] >| 
APRPA] SR 
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Exercises 
Construct the truth tables for the following statement Formulas. 


1.2.1 PA —P. 

1.2.2 (PA 0) = QAP). 

1.2.3 (P \ Q) => (PV Q). 

1.2.4 P <=> —-(-P). 

1.2.5 —(P \ Q) => (-P\V —-Q). 
1.2.6 (P => Q) => (-PV Q). 
12.7 O= PV —-OQ): 

1.2.8 (PA —-P) =. 

1.2.9 (PA Q) = @. 

1.2.10 P => (P\ Q). 

1211 P>O> a. 

1.2.12 [(-P—> OS [PA-2VA-PAQ)I. 
12.13 (P|) => —P. 

1.2.14 P => ((-P) AQ). 

12.15 (SO VoO=}P). 


1.2.16 (P= > 0) = (OQ => P). 

1.2.17 [P\V(Q A —Q)| => P. 

1.2.18 (P <=> Q) \/(-P). 

1.2.19 P\/(-P AQ). 

1.2.20 (PV O)\ VR PV(QV R). 

1.2.21 (PAQ) AR PAQA|R). 

1.2.22 PV(O A R) => (PV OQ) A(PV R). 
1.2.23 PA(QVR) => (PAQVPAR). 
1.2.24 [(P — > Q) \(Q — > R)] => [P — RI. 


1.2.25 [(P => Q) V(O => R)] => [P = R]. 
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1.3 Quantifiers 


Universal Quantifier 


Consider the statement ‘For every river, there is an origin.’ This can be rewritten 
as ‘For every x, ‘x is a river’ implies ‘x has an origin’.’ More generally, we have a 
statement of the form ‘For every x, P(x).’, where ‘P(x)’ is a valid statement involving 
x. The symbol ‘V’ is used for “for every,’ and it is called the universal quantifier. 
The example 9 of Sect. 1.1 may be represented by “Vx(‘x is a river "==> ‘x has an 
origin’)’. 

Existential Quantifier 


Consider the statement “There is a man who is immortal.’. More generally, we have 
statements of the form “There exists x, P(x).’, where ‘P(x)’ is a statement involving x. 
The symbol ‘J’ stands for ‘there exists,’ and it is called the existential quantifier. The 
example 10 of Sect. 1.1 may be represented as ‘4x, “x is a man’ and ‘x is immortal’.’. 

Parenthesis ‘( )’ and brackets ‘[ ]’ will be used to limit the scope of propositional 
connectives and quantifiers to make valid mathematical statements. 


Negation of a Statement Formula Involving Quantifiers 


Consider the statement ‘Every river has an origin.’. This can be rephrased as 
“Wx(‘x is a river’ = > ‘x has an origin’).’. When can this statement be false? It 
is false if and only if there is a river which has no origin. Similarly, consider the 
statement ‘Every man is mortal.’ . This can also be rephrased as ‘Vx(‘x is a man’ => 
‘x is mortal’).’. Again this statement can be challenged if and only if there is a man 
who is immortal. Now, consider the statement ‘There is a river which has no origin. ’. 
To say that this statement is false is to say that “Every river has an origin.’ This 
prompts us to have the truth functional rule for the statement formulas involving 
quantifiers as given by the following table. 


Vx(P(x) => O(x))|Ax(P@) A —O@)) 


Thus, ‘—[Vx(P(@~) = > Q(x))] — [8x(P@) A —@@))], where P(x) and 
Q(x) are valid statements involving the symbol x, is always a true statement. Also 
‘—[A(P(x) => Q(x))] = > [Vx((Px) A —Q(@))) is always a true statement. 


8 1 Language of Mathematics 1 (Logic) 


1.4 Tautology and Logical Equivalences 


A statement formula is called a tautology if its truth value is always T irrespective 
of the truth values of its atomic statement variables. A statement formula is called 
a contradiction if its truth value is always F irrespective of the truth values of its 
atomic statement variables. Thus, the negation of a tautology is a contradiction, and 
the negation of a contradiction is a tautology. 

All the examples in Sect. 1.2 except the Example 1.2.2 are tautologies. ‘P /\ —P’ 
is a contradiction. 


Example 1.4.1 ‘—Wx(P(x) => Q(x)) = > Ax(P(®) A —Q(a))’ is a tautology. 


Example 1.4.2 ‘((P = > Q) \(Q => R)] > (P = > R)’ is a tautology (verify 
by making truth table). 


Thus, if the statement formulas ‘A => B’ and ‘B => C’ are tautologies, then 
‘A => C’ is also a tautology. 

For the given statement formulas ‘A’ and ‘B’, we say that A logically implies B or 
B logically follows from A if ‘A => B’ is a tautology. (Here, A and B are not simple 
statement variables). In fact, if “P’ and ‘Q’ are statement variables, then “P —> Q”’ 
is a tautology if and only if P is same as Q. Further, the statement formula A is said 
to be logically equivalent to B if ‘A <=> B’ is a tautology. 

In mathematics and logic, we do not distinguish logically equivalent statements. 
They are taken to be same. If A is logically equivalent to B, we may substitute B for 
A and A for B in any course of discussion or derivation. 


Example 1.4.3 ‘P =» Q’ is logically equivalent to ‘—P \/ Q’. 


Example 1.4.4 ‘—[Vx(P(x) = Q(x))]’ is logically equivalent to ‘Ax(P(x) /\ — 
Q(x))’ and ‘Wx(P(x) A —Q(x))’ is logically equivalent to ‘—[Ax(P(Qx) => 
O(x)))’. 


Example 1.4.5 The notation lim, >. X, = x stands for the statement 

‘Ve[eis a positive real number => 

AN(N is a natural number => Vn(n is greater than N 

= >| x, — x | is less than €))]’. 

If we apply the logical equivalence in Example 1.4.4 repeatedly, then we find that 
— (limps 00 Xn = X) 1S Same as 

de (€ is a positive real number /\ VN 

(N is a natural number /\ 

dn(n is greater than N /\ | X, — x | is not less than €))). 
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Exercises 


1.4.1 Find out which of the statement formulas in exercises from 1.2.1 to 1.2.25 are 
tautologies and which of them are contradictions. 


1.4.2 Is ‘Sun rises from the east’ a tautology? 


1.4.3 Obtain a logically equivalent statement formula for the negation of ‘Vx[P(x) 


=> (RO) AT). 


1.4.4 Show that the set ‘{\/,—}’ of propositional connectives is Functionally 
Complete in the sense that any statement formula is logically equivalent to a statement 
formula involving only two connectives \/ and —. Is the representation thus obtained 
unique? Support. Similarly, show that the set ‘{ /\, —}’ is also Functionally Complete. 
Thus, the set ‘{\/, —}’ of propositional connectives is sufficient to develop the math- 
ematical logic. 


1.4.5 Let A be a statement formula which is a tautology. Suppose that ‘A => B’ 
is also a tautology. Show that B is also a tautology. Can B be a statement variable? 
Support. 


1.4.6 Let A be a statement formula which is a tautology. Show that ‘A \/ B’ and 
‘B => A’ are also tautologies. 


1.4.7 Suppose that A and B are tautologies. Show that ‘A /\ B’ is also a tautology. 


1.4.8 Suppose that ‘A => B’ and ‘B => C’ are tautologies. Show that ‘A => C’ 
is also a tautology. 


1.5 Theory of Logical Inference 


In any course of mathematical derivations and inferences, we have certain statements 
termed as axioms, premises, postulates, or hypotheses whose truth values are assumed 
to be T, and then infer the truth of a statement as a theorem, proposition, corollary, 
or a lemma. Indeed, a statement is a theorem (proposition, lemma, or a corollary) if 
and only if the conjunction of premises tautologically imply the statement. 

The theory of logical inference is like playing games. Take, for example, a game 
of chess. The initial position of the chess board corresponds to premises. There 
are finitely many rules of the game, and the players have to follow these rules while 
making their moves. These rules of the game correspond to tautological implications. 
The player | initially moves one of his chess pieces as per the rules of the game. The 
new position of the chess board becomes premises for the player 2. The player 2, in 
his turn, moves one of his chess pieces as per rules of the game, of course, keeping 
his eyes on a winning position. Next, the player 1, in his turn, takes this new position 
as the premises and moves one of his chess pieces as per rules and so on. The player 
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who reaches the winning position (desired theorem) wins the game. However, the 
game may end in a draw, and the players may reach at a position of the chess board 
from where no player can ever reach the winning position by moving the chess pieces 
as per rules. Each position of the chess board where the players reach corresponds 
to a theorem. 

Thus, our main aim is to describe irredundant finite set of rules of inferences 
which meets the following two criteria. 


1. For any set of premises, the rules of logical inferences must allow only those 
statements which follow tautologically from the conjunction of premises. 

2. All statements which follow tautologically from the conjunction of premises can 
be derived in finitely many steps by applying the rules of inferences. 


Indeed, the following three rules of inferences are adequate to derive all theorems 
under given premises. 

Rule |. A premise may be introduced at any point in a derivation. 

Rule 2. A statement ‘P’ may be introduced at any point of derivation if the con- 
junction of the preceding derivations tautologically implies ‘P’. 

Rule 3. A statement ‘If ‘P,’ then ‘Q’ ’ may be introduced at any point of derivation 
provided that ‘Q’ is derivable from the conjunction of ‘P’ and some of the premises. 

We illustrate these rules of inferences by means of some simple examples. 


Example 1.5.1 Vf ‘Shreyansh is a prodigy, then ‘if ‘he will become a scientist,’ then 
‘he will win a Nobel prize’ ’. ‘Shreyansh will become a scientist’ or “he will become a 
cricketer.’ If “Shreyansh is nota prodigy,’ then “he will become acricketer.’ Shreyansh 
will not become a cricketer. Therefore, ‘Shreyansh will win a Nobel prize.’ 

Here, the statement “‘Shreyansh will win a Nobel prize’ is to be derived as a 
theorem. The statements preceding to this statement are premises. We use the rules 
of inferences to deduce this theorem. We symbolize the statements as follows: Let 
‘P’ stands for the statement ‘Shreyansh is prodigy, “S’ for the statement “He will 
become a scientist,’ ‘N’ for the statement ‘Shreyansh will win a Nobel prize’ and *C” 
for the statement ‘Shreyansh will become a cricketer’. Thus, “P => (S => N), 
‘S\V C” ‘—P => C’ and ‘—C’ are premises and we have to derive N as a theorem. 
Now, 


Dee, SO eae cin cathe teed eageete ieee Premise (Rule 1). 

2 SP i ccticsctitennedes Premise (Rule 1). 

De EP seeasssits shisavsin Stead teases iiss: ‘(-P => C) A —C’ tautologically implies ‘P’ 
(Rule 2). 

4. ‘P=> (S => N).’....... Premise (Rule 1). 

DS SSN aster alii. ‘P \(S = > NY’ tautologically implies ‘S => N’ 
(Rule 2). 

65. SV CO derctavsereiserags ss tiasincies Premise (Rule 1). 

Te OS iti an naanieiaind ‘—C A(S \V C)’ tautologically implies ‘S’ (Rule 
2). 

Ben GIN sSiae' as suedh WSeessasateuscstuguss dissevsils °S A(S = > NY tautologically implies ‘N’ (Rule 


1.5 Theory of Logical Inference 11 


This establishes ‘N’ as a theorem. Note that in this derivation we have not used the 
rule 3. 


Example 1.5.2 ‘Tf Shreyal is not a genius,’ then ‘he cannot solve difficult mathemat- 
ical problems.’ ‘If he cannot solve difficult mathematical problems,’ then ‘he will not 
become a great mathematician.’ ‘Shreyal will become a business tycoon’ or ‘he will 
become a great mathematician.’ He will not become a business tycoon. Therefore, 
‘Shreyal is genius.’ 

Here, the statement ‘Shreyal is genius’ is to be derived as a theorem. The state- 
ments preceding to this statement are premises. We use the rules of inferences to 
deduce this theorem. We symbolize the statements as follows: Let ‘G’ stands for the 
statement ‘Shreyal is genius,’ ‘P’ for the statement ‘He can solve difficult mathemat- 
ical problems, ‘M’ for the statement ‘He will become a great mathematician,’ and 
‘B’ for the statement “Shreyal will become a business tycoon’. Thus, “—G => —P’, 
‘_P => —M’, ‘B\/ M’, and ‘—B’ are premises, and we have to derive G as a 
theorem. Now, 


We SBE eikaiiekee doe Premise (Rule 1) 

2. ‘B\ Moe Premise (Rule 1) 

Be OM ccceatetuetivenanehes ’-B \(B\V M)’ tautologically imply ‘M’ (Rule 2) 

4, ‘—P => —M’......... Premise (Rule 1) 

Ds PP piss cetpeeienevasuet danse >(—P => —M) ( M’ tautologically imply ‘P’ (Rule 2) 
6. ‘—-G => —-P’......... Premise (Rule 1) 

Te, GO geese ciwirtie ee *(—G => —P) (J P’ tautologically imply ‘G’ (Rule 2) 


This establishes G as a theorem. Note that in this derivation also we have not used 
the rule 3. 


The next example uses rule 3 also. 


Example 1.5.3 Yf*Sachiis honest,’ then ‘if “she is brave, then ‘she will be intelligent”. 
‘Sachi is not hard working or she is honest’. ‘Sachi is brave’. Therefore, if “Sachi is 
hard working,’ then ‘she will be intelligent.’ 

Here, the statement ‘If ‘Sachi is hard working,’ then ‘she will be intelligent’ ’ is 
to be derived as a theorem. The statements preceding to this statement are premises. 
We use the rules of inferences to deduce this theorem. Symbolize the statements as 
follows: Let ‘H’ stands for the statement ‘Sachi is honest,’ ‘B’ for the statement ‘She 
is brave, ‘J’ for the statement “She will be a intelligent, and ‘W’ for the statement 
‘Sachi is hard working’. Thus, ‘H => (B => 1), ‘—W \/ H’ and ‘B’ are premises, 
and we have to derive ‘W => I’ as a theorem. Now, 


1. ‘H => (B= > J)’.....Premise (Rule 1). 

De SWAN AP ccc eed neseetstsaties Premise (Rule 1). 

Be UB scsedi lateseskts Seadstehecdadicacienss Premise (Rule 1). 

Bie OW. a AT. ozs ts ssnsssiasescsnivers The conjunction ‘W’ and the premise ‘—W \/ H’ 


tautologically imply ‘H’ (Rule 3). 
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eae | e—— 2 eee ore The conjunction of ‘W, ‘W => H, ‘B’ and ‘H => 
(B => I) tautologically imply ‘J’ (Rule 2 and Rule 3). 


Consistency of Premises 


A set of premises is said to be a consistent set of premises if the conjunction of 
premises has its truth value T for some choice of truth values of each premise. 
It is said to be inconsistent, otherwise. Thus, to derive that a set of premises is 
inconsistent is to derive that the conjunction of the set of premises tautologically 
imply the statement ‘P /\ —P’. However, in many situations, it is not so easy to 
establish the consistency of premises. Easiest way, perhaps, is to have an example 
where all the premises happen to be true. 

Often, a lawyer in a court while cross-examining a witness of the other side tries 
to establish that the evidences and statements of witness as premises is inconsistent 
by producing a paradox out of witness and there by discrediting the witness. 


Example 1.5.4 Vf in the set of premises of Example 1.5.3, we adjoin the statement 
‘Sachi is hard working and she is not intelligent, then the set of premises becomes 
inconsistent. For, then ‘(W ==> I) \(W (A —JD)’ is logically derivable from the 
set of premises. Observe that ‘(W => I) \(W A —J)’ is logically equivalent to 
‘PA —-P’. 


Exercises 


1.5.1 If ‘the prices of the essential commodities are low, then ‘the government 
will become popular.’ “The prices of the essential commodities are low’ or ‘there 
is a shortage of the essential commodities.’ If ‘there is a shortage of the essential 
commodities,’ then ‘the production of essential commodities is low.’ However, ‘there 
is a huge production of essential commodities.’ Using the logical rules of inference, 
derive the proposition ‘The government will become popular.’ 


1.5.2 ‘Sachi is creative’ or ‘she is intelligent.’ If ‘Sachi is creative,’ then ‘she is 
imaginative.’ “Sachi is not imaginative’ or “she is not a musician.’ In fact, “Sachi is 
a musician.’ Derive the statement ‘Sachi is intelligent’ as a theorem. 


1.5.3. Test for consistency the following set of premises. 
If ‘Shikhar is good in physics, then ‘he is good in mathematics.’ If ‘he is good 


in mathematics, then ‘he is good in logic.’ “He is good in logic’ or ‘he is good in 
physics.’ He is not good in logic. 


Chapter 2 
Language of Mathematics 2 (Set Theory) 


This chapter contains a brief introduction to set theory which is essential for doing 
mathematics. There are two main axiomatic systems to introduce sets, viz. Zermelo— 
Fraenkel axiomatic system and the Gédel—Bernays axiomatic system. Here, in this 
text, we shall give an account of Zermelo—Fraenkel axiomatic set theory together 
with the axiom of choice (an axiom which is independent of the Zermelo—Fraenkel 
axiomatic system). We also discuss some of the important and useful equivalents 
of the axiom of choice. The ordinal and the cardinal numbers are introduced and 
discussed in a rigorous way. For the further formal development of the theory, the 
reader is referred to the “Set Theory and Continuum hypothesis’ by P.J. Cohen or the 
‘Axiomatic set theory’ by P. Suppes. 


2.1 Set, Zermelo—Fraenkel Axiomatic System 


‘Set’, ‘belongs to, and ‘equal to’ are primitive terms of which the reader has intuitive 
understanding. Their use is governed by some postulates in axiomatic set theory. 

To take the help of intuition in ascertaining the use of the primitive terms, we 
regard a set as a collection of objects. ‘A class of students,’ ‘a flock of sheep,’ ‘a 
bunch of flowers,’ and ‘a packet of biscuits’ are all examples of sets of things. The 
notation “a € A’ stands for the statement “a belongs to A’ (‘a is an element of A,’ or 
also for ‘a is a member of A’). The negation of ‘a € A’ is denoted by ‘a ¢ A.’ The 
notation ‘A = B’ stands for the statement “A is equal to B.’ The negation of ‘A = B’ 
is denoted by ‘A # B.” The following axiom relates ‘e’ and ‘=.’ 


Axiom 1 (Axiom of extension) Let A and B be sets. Then, 


‘A = B if and only if ‘for all x (x € A if and only if x € B).’ 


© Springer Nature Singapore Pte Ltd. 2017 13 
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Thus, two sets A and B are equal if they have same members. Two equal sets are 
treated as same. If A = B, then we may substitute A for B and B for A in any course 
of discussion. 


Remark 2.1.1 Tobe logically sound in the use of primitive terms, axiom of extension 
is a necessity. 


Let A and B be sets. We say that A is a subset of B (A is contained in B or 
B contains A) if every member of A is amember of B. The statement ‘A is a subset of B’ 
is the same as the statement ‘For all x(if x € A, then x € B).’ The notation ‘A C B’ 
(or also ‘B D A’) stands for the statement ‘A is a subset of B.’ Thus, ‘A = B’ (axiom 
of extension) if and only if ‘A C B and B C A.’ The negation of ‘A C B’ is denoted 
by ‘A ¢& B.’ Since the negation of the statement ‘For all x(if x € A, then x € B)’ 
is logically same as the statement ‘There exists x(x € A and x ¢ B), the notation 
‘A ¢ B’ stands for the statement ‘There exists x(x € A and x ¢ B).’ Thus, to say 
that A is not a subset of B is to say that there is an element of A which is not in B. 

Every set is a subset of itself, because ‘For all x(if x € A, then x € A)’ is 
a tautology (always a true statement). If A C Band A # B, then we say that 
A is a proper subset of B. The notation ‘A C B?’ stands for the statement ‘A is 
a proper subset of B.’ Thus, A is a proper subset of B if every member of A is 
a member of B, and there is a member of B which is not a member of A. More 
precisely, ‘A C B’ represents the statement ‘(For all x(if x € A, then x € B)) and 
(there exists x(x € Band x ¢ A)).’ 


Proposition 2.1.2 [fA C Band BCC, thenA CC. 


Proof Suppose that A C Band B C C. Let x € A. Since A C B, x € B. Further, 
since B C C,x € C. Thus, ‘for all x(if x € A, thenx € C).’ This shows that A C C. 
t 


Some of the axioms of set theory are designed to produce different sets out of 
given sets. The first one is to generate subsets of a set. 

Consider the set A of all men and the statement ‘x is a teacher.’ Some members of 
A are teachers, and some of them are not. The condition that ‘x is a teacher’ defines 
a subset of A, namely the set of all male teachers. To make it more formal, we have: 


Axiom 2 (Axiom of specification) Let A be a set, and P(x) be a valid statement 
involving the free symbol x. Then, there is a set B such that 


‘for all x(x € B if and only if (x € A and P(x)).’ 


Thus, to every set A, and to every statement P(x), there is a unique set B whose 
members are exactly those members of A for which P(x) is true. 

The set B described above is denoted by {x € A | P(x)}. Clearly, B is a subset of 
A. 


Proposition 2.1.3. Let A be a set. Then there is a set B such that B ¢ A. 
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Proof Consider the statement ‘x is a set and x ¢ x.’ By the axiom of specification, 
there is a unique set B = {x € A such that x is a set and x ¢ x}. We show that 
B ¢ A. Suppose that B € A. If B € B, then B ¢ B. Next, if B ¢ B, then since BE A 
(supposition), and B is a set, B € B. Thus, ‘B ¢ B if and only if B € B. This isa 
contradiction (P if and only if —P is a contradiction) to the supposition that B € A. 
Hence, B ¢ A. tt 


Corollary 2.1.4 There is no set containing all sets.! tt 
Let A and B be sets. Consider the statement ‘x € B.’ The set {x € A | x € B}is 
denoted by ‘A (| B, and it is called the intersection of A and B. Thus, 


x €A( |B if and only if (x € A and x € B). 


Since ‘[x € A and x € B] if and only if [x € Band x € A]’ is a tautology, we 
have the following proposition. 


Proposition 2.1.5 A()B= B/)A. t 
Proposition 2.1.6 A(]BCAandA()\BCB. 


Proof By the definition, x € A) B if and only if [x € A and x € B]. Further, ‘if 
[x € Aand x € B], thenx € A’ isa tautology. Thus, if x € A(| B, then x € A. This 
shows that A () B C A. Similarly, A) B C B. tt 


Proposition 2.1.7 If [C C A and C C B], then[C CA) B]. 


Proof Suppose that C C A andC C B. Letx € C. Since C CAandC CB,xeEA 
and x € B. Thus, x € A()B. Hence, if x € C, then x € A()B. This shows that 
CEA(|Fs tt 


Proposition 2.1.8 [A (B= A] if and only if [A C B]. 


Proof Suppose that A()B = A. Since A(|B C B (Proposition 2.1.6), A C B. 
Suppose that A C B. Since A C A, A C Af) B (Proposition 2.1.7). Also, A (] BCA 
(Proposition 2.1.6). By the axiom of extension, A (| B = A. tt 


Proposition 2.1.9 (A(]B)()C =Af\(B(] C). 


Proof Let x € (A({)B)(\C. By the definition, (x € A and x € B) andx € C. 
This implies (tautologically) that x € A and  € Band x € C). It follows that 
x €A(\(B()C). Thus, (AQ) B) NC € ABN C). Similarly, AABN C) < 
(A (| B) () C. By the axiom of extension, the result follows. tt 


‘In pre-axiomatic intuitive development of set theory, people took for granted that there is a set 
containing all sets. The argument used in the proof of the Proposition 2.1.3 led to a paradox known 
as ‘Russel’s paradox.’ In fact, the need for axiomatization of set theory was consequence of such 
paradoxes. 
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Let A and B be sets. Consider the statement x ¢ B. By the axiom of specification, 
there is a unique set defined by {x € A | x ¢ B}. This set is denoted by A — B, and it 
is called the complement of B in A (or A difference B). Clearly, A — B is a subset of 
A. 


Proposition 2.1.10 A—B=A-— (A()B). 


Proof Letx ¢ A—B. By the definition, x ¢ Aandx ¢ B. This implies (tautologically) 
that x € A and(x € A and x ¢ B). Thus, x € A — (A{)B). This shows that 
A—BCA-—(Af)B). Similarly, A — (A) B) C A —B. By the axiom of extension, 
the result follows. tt 


To have something in our hand, we formally assume the existence of a set as an 
axiom. 


Axiom 3 (Axiom of existence) There exists a set. 


Let A be a set. Consider A — A. If B is any set, then 
(x € Aand x ¢ A) if and only if (x € Band x ¢ B) 


is a tautology (note that ‘(P and —P) if and only if (Q and —Q)’ is a tautology). Thus, 
x € (A —A) if and only if x € (B — B), andso A — A = B — B. Therefore, the set 
A — A is independent of A. This set is called the empty set, or the void set, or the 
null set, and it is denoted by J. Thus, 6 = {x € A | x ¢ A}. Clearly, ‘x € # isa 
contradiction. Further, the statement ‘if x € %, then Q’ is a tautology whatever the 
statement Q may be. 

Let P(x) be any contradiction involving the symbol x. Clearly, then = {x € A | 
P(x)}. Intuitively, one may think of J as a set containing no elements. 


Proposition 2.1.11 The empty set O is a subset of every set. 


Proof Let B be a set. We have to show that ‘if x € 0, thenx € BY Sincex € Disa 
contradiction, ‘if x € J, then x € B’ is a tautology. Hence, 4 C B. tt 


Proposition 2.1.12 A — B = Y if and only if A CB. 


Proof Suppose that A — B = J. Letx € A. SinceA—B=%,x ¢ A—B (forx ¢ Dis 
a tautology). Further, since x € A and x ¢ A — B, x € B. Hence, A C B. Conversely, 
suppose that A C B. We have to show that A — B = J. Already (Proposition 2.1.11), 
we have @ CA — B. Letx € A— B. Then, x € A and x ¢ B. Since A C B, it follows 
that x € B and x ¢ B. This, in turn, implies that x € @. Hence, A — B C &. tt 


Axiom 4 (Axiom of replacement) Let A be a set, and P(x, y) be a statement formula 
involving x and y such that Vx € A((P(x, y) and P(x, z)) = > y = z). Then, there is 
asetB = {y| P(x, y) holds for some x € A}. 
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The axiom tells that if A is a set, and there is a correspondence from the members 
of A to another collection of objects associating each member of A a unique member 
of the collection, then the image is set. This axiom will be used in our discussions 
on ordinals. 

The following axiom helps us to generate more sets. 


Axiom 5 (Pairing axiom) Let A and B be sets. Then, there is a set C such that A € C 
andBe C. 


Consider the statement “x = Aorx = B. By the axiom of specification, we 
have a unique set {x € C | x = A or x = B}. This set is also independent of the set 
C. It contains A and B as elements and nothing else. We denote this set by {A, B}. 
The set {A, A} is denoted by {A}, and it is called a singleton. 

We have the empty set J. Consider {0}. Since @ € {A} and@ ¢ 0,0 A {G}. If {0} = 
{{D}}, then @ = {0}. This is a contradiction. Hence, {0} 4 {{G}}. Similarly, {{{A}}} F 
{{H}}. Axiom of pairing gives us other new sets such as {@, {0}}, {{0, {G}} and, {{A}}}. 
This way we produce several sets. 


Axiom 6 (Union Axiom) Let A be a set of sets. Then, there is a set U such that 
“(X € A and x € X) implies that x € U.’ 


By the axiom of specification, we have the unique set given by 
{x € U| x €X for some X € A}. 


This set is denoted by Uy_,4 X, and it is called the union of the family A of sets. 
Thus, 
xe€ |) X if and only if x € X for some X EA. 
XeA 


What is Uy.4 X? Ifx € Uyey X , then there exists X € % such that x € X. But X € J 
is a contradiction. Hence, Uy. X = Y. Clearly, Uxeray X=A. 
The set Ux<,4,n) X is denoted by A L B. Thus, 


x €A| J Bif and only if x € Aorx eB. 


The set A ) B is called the union of A and B. 
Proposition 2.1.13 A CAB. 


Proof Suppose that x € A. Then, the statement ‘x € A or x € B’ is true (if P, 
then (P or Q) is a tautology). Hence, if x € A, thenx € AU B. Thus,A CAUJB.t 


Proposition 2.1.14 AJ% =A. 


Proof Since x € @ is always false, x € A if and only if (x € A or x € 0). Hence, 
AUG=A. t 
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Proposition 2.1.15 A) B= BUA. 


Proof Clearly, ‘(x € A or x € B) if and only if (x € Bor x € A) is a tautology. 
Hence, AU) B= BUA. tt 


Proposition 2.1.16 AUJA=A. 


Proof Since the statement ‘(x € A or x € A) if and only if x € A’ is a tautology, the 
result follows. t 


Proposition 2.1.17 A |) B =A if and only if BCA. 


Proof Suppose that A (J B = A. By the Proposition 2.1.13, B C AUB = A. Next, 
suppose that B C A. Then,A CAL) B CAUA = A. Hence, AU B=A. tt 


Proposition 2.1.18 (AU) B) JC =AUBUC). 


Proof Let x € (AU B) UC. By the definition, ‘(x € A or x € B) or x € C.” This 
implies (tautologically) that “x € A or (« € Borx e€ C).’ It follows that ‘x € 
AUBUC). Thus, (AUB)UC © AUBUC). Similarly, ‘AY(BUC) 
(A LJ B)() C.’ By the axiom of extension, the result follows. tt 


Proposition 2.1.19 The union distributes over intersection, and the intersection dis- 
tributes over union in the following sense: 

LAUBlNC)=AUB MAU OC), and 

2. (A((BUC) = ANB) UN ©). 


Proof 1. Letx € AUJ(B{) C). By the definition, ‘x € A or (x € Band x € C).’ 
This implies (tautologically) that ‘(x € A or x € B) and (x € Aorx € C). In turn, 
‘x € (AUB) (\(AU ©).’ This shows that ‘A (BIC) € (AUB) MAU C).’ 
Similarly, ‘(A J B) (AU C) CAU(Bf) C).’ By the axiom of extension, ‘A J (B 
NO =AUBNAUC? 


Similarly, we can prove 2. tt 


Theorem 2.1.20 (De Morgan’s Law) Let A, B, and C be sets. Then, 
1A-—(BUC)=(A-B)(\(A-C). 
2.A—(B(\C)=(A-B)UA-OC). 


Proof 1. First observe that the statement ‘x ¢ (B\JC)’ is logically equivalent to 
the statement ‘x ¢ B and x ¢ C.’ Let x € A— (BUC). Then, by the definition, 
‘x € Aand x ¢ (B\LJC).’ This implies that ‘x € A and (x ¢ Band x ¢ C).’ 
In turn, it follows that ‘~~ € A and x ¢ B) and (x € A and x ¢ C).’ Thus, 
‘x € (A — B)()\(A — C).’ This shows that ‘A — (BUC) € (A- B)(\A-C).’ 
Similarly, ‘(A — B) (\(A — C) CA— (BU C).’ The result follows by the axiom of 
extension. The proof of 2 is similar. tt 
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Axiom 7 (Power Set Axiom) Given a set A, there is a set Q such that B C 
A implies that B € Q. 


Consider the statement ‘x is a subset of A.’ By the axiom of specification, we have 
a unique set given by 
{x € Q| x is a subset of A}. 


This set is independent of the choice of (2 in the power set axiom. We denote this 
set by 9 (A) and call it the power set of A. 

Since the empty set J is a subset of every set, so (A) can never be an empty set. 
What is go (J)? Since 0 C YB, W € — (VY). Suppose that A € (4). Then, A C GJ. But, 
then if x € A, thenx € %.Sincex € is acontradiction, x € A is also acontradiction. 
Hence, A = &. Thus, (0) = {@}. Further, A € go ({%}) if and only if A C {@}. 
This shows that A = @ or A = {@}. Thus, go ({O}) = {G, {O}}. Further, go ({, {O}}) = 
{O, {O}, {{0}}, {W, {}}}, and so on. 

The next axiom is the axiom of regularity (also called the axiom of foundation). It is 
used specially in discussions involving ordinal arithmetic. In axiomatic set theory, the 
members of sets are also sets. Indeed, any mathematical discussion can be modeled 
so that all the objects considered are sets of sets. For example, | can represented by 
{G}, 2 can be represented by {J, {4}}, and so on. The axiom is designed to restrict 
uncomfortable situations such as A € A, (A € Band B € A), and (A € Band Be 
C and C € A) in any course of discussion. 


Axiom 8 (Axiom of regularity) If A is a nonempty set of sets, then ‘there exists 
X(X €AandX(\A = 0) 


Thus, given a nonempty set A of sets, there is a set X in A such that no member 
of X isin A. 


Theorem 2.1.21 Let A be a set of sets. Then, A ¢ A. 


Proof Let A be a set. {A} 4 @. By the axiom of regularity, there exists X € {A} 
such that if x € X, thenx ¢ {A}. Now, X ¢€ {A} if and only if X = A. Thus, 
if x € A, thenx ¢ {A}. Since A € {A}, A GA. tt 


Theorem 2.1.22 Given sets A and B,A ¢ BorBé€A. 


Proof Suppose that A € B and B € A. Then, B € A, B € {A, B}, A € B, and also 
A € {A, B}. Thus, there is no X € {A, B} such that x € X implies that x € {A, B}. 
This contradicts the axiom of regularity. ft 


Let X be a set. The set Xt = X LJ{X} is called the successor of X. 
Proposition 2.1.23 Let X and Y be sets. Then, Xt = Y* if and only if X = Y. 


Proof If X = Y, then Xt = Y*. Suppose that ¥ #4 Y and Xt = Y*. Then, 
X U{X} = ¥ U{}. Since X € X U{X}, X € YU{Y}, and since X A Y,X EY. 
Similarly, Y € X. This is a contradiction (Theorem 2.1.22). t 
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A set S is called a successor set if 


(i) {@} € S, and 
(ii) X € S implies X* ¢€ S. 


The following axiom asserts that there is an infinite set. 


Axiom 9 (Axiom of infinity) There exists a successor set. 


Proposition 2.1.24 Let X be a set of successor sets. Then, (\s-y S is also a successor 
set. 


Proof Since each S is a successor set, {4} € S, for all S € X. Hence, {0} € (Vsey S. 
Let x € (\sex S. Then, x € S, for all S € X. Since each S € X is a successor set, 
xt €S, for all S € X. Hence, xt € ()s.¥S. tt 


Corollary 2.1.25 Let X be a successor set. Then X contains the smallest successor 
set contained in X. 


Proof The intersection of all successor sets contained in X is the smallest successor 
set contained in X. tt 


Corollary 2.1.26 Let X and Y be successor sets. Let A be the smallest successor set 
contained in X, and B the smallest successor set contained in Y. Then A = B. 


Proof X (\ Y is also a successor set. Thus, A and B are both smallest successor sets 
contained in X () Y. t 


Let X be a successor set. The smallest successor set contained in X, which is 
the smallest successor set contained in any other successor set, is called the set of 
natural numbers. The set of natural numbers is denoted by N. {@} is denoted by 1, 
and it is called one. {G}* = {@, {G}} is denoted by 2, and it is called two, and so 
on. The properties of the set N of natural numbers can be faithfully described in the 
form of Peano’s axioms as given below: 


Peano’s Axiom 


Py.le N. 

P,. For allx € N, xt EN. 

P3.xt =y* if and only if x = y. 

Py. For allx EN, 1 A xt. 

Ps. If M is a set such that 1 € M and x* € M for allx € M()\N, then N C M. 
Further properties of the natural number system N will be discussed in detail in the 
next chapter. 


Exercises 
2.1.1 Show that 


(i) ANB=B 
(ii) AUB=A 
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(iii) A-®=A 

(iv) B-A=9%. 

2.1.2 Show that A — (A — B) =A()B. 

2.1.3 Show that A — (A()B) =A-—B. 

2.1.4 Show that A) B = A if and only if BCA. 

2.1.5 Show that (A()B) UC = AUC) NBUO. 

2.1.6 Show that (A()B) UC =A(\(BU ©) if and only if CCA. 
2.1.7 Show that A C B implies CU AC CUB. 

2.1.8 Show that (A4—B)-C =(A—C)-B. 

2.1.9 Show that 


(Gi) AQ(BUA) =A. 
Gi) A=AU(BNA). 


2.1.10 Put A @ B = (A — B) \(B — A). Show that 


i) ADB OBC=AGBOC). 

(i) ADD=A=D@A. 

(iii) ADB=BOA. 

(iv) APB B=G if and only if A= B. 

(v) ANBOC = ANB@ANO. 
(vi) ABC =BQC if and only if A= B. 


2.1.11 A Cc Bif and only if — (A) C p(B). 
2.1.12 Show that 9 (A () B) = (A) (1) 9 (B). 


2.1.13 Show that 9 (A) U @(B) © »(A UB). Show by means of an example that 
equality need not hold. 


2.1.14 Suppose that A contains n elements. Show that so (A) contains 2” elements. 
2.1.15 Can go (A) be 4? Support. 

2.1.16 Show that a union of successor sets is a successor set. 

2.1.17 Let A be a successor set. Can 9 (A) be a successor set? support. 

2.1.18 Let A and B be successor sets. Can A — B be a successor set? Support. 
2.1.19 Show that Xt # X for every set X. 

2.1.20 (Xt)* #X for every set X. 


2.1.21 Show that the empty set is not successor of any set. 
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2.2 Cartesian Product and Relations 


Let X be a set. Leta, b € X. Then, the set {{a}, {a, b}} is a subset of go (X). We denote 
the set {{a}, {a, b}} by (a, b) and call it an ordered pair. Thus, (a, b) € (9 (g0(X)). 


Proposition 2.2.1 (a, b) = (b, a) if and only if a= b. 


Proof Suppose that (a,b) = (b,a). Then, {{a}, {a,b}} = {{b}, {b, a}}. Since 
{a, b} = {b, a}, {a} = {b}. Hence, a = b. Clearly, a = b implies (a, b) = (a, a) = 
(b, a). t 


Observe that (a, a) = {{a}, {a, a}} = {{a}, {a}} = {{a}}. 
Let X and Y be sets. Then, the set 


XxY={(a,b)|aeXandbe Y} 


is called the cartesian product of X and Y. Clearly, X x Y C p(g(X U Y)). 
Proposition 2.2.2 Let A, B, and C be sets. Then, 


(i) (AUB) x C=(Ax C)UBx C). 
(ii) (ANB) x C=(Ax C)I\(Bx C). 
(iii) (A—B)x C=(Ax C)— (Bx C). 


Proof (i). Let (x, y) € (AUB) x C. By the definition, ‘x ¢€ AU) Band y € C,’ 
This implies that ‘(x € A and y € C) or (x € Band y € C).’ Thus, ‘(x,y) € 
(A x C) or (x,y) € (B x C).’ By the definition, (x,y) € (A x C)U(B x C). 
It follows that (AL) B) x C C (Ax C)U(B x C).’ Similarly, it follows that 
‘(Ax C)U(Bx C) © (AUB) x C.’ By the axiom of extension, (A J B) x C = 
(Ax C)UBx C). 

Similarly, we can prove (ii) and (iii). tt 


Proposition 2.2.3 A x B = Wif and only if (A = Yor B= 9). 


Proof Suppose that A = @, and (x, y) € A x B. Then, x € J and y € B. Since x € G 
is acontradiction, (x, y) € @ x Bis also a contradiction. Hence, 4 x B = %. Similarly, 
A x J = %. Now, suppose that A ~ J and B # WY. Then, there is an element x € A 
and an element y € B. In turn, (x, y) € A x B. Hence, Ax BAY. tt 


Relations 


Consider the relation ‘is father of.’ Nehru is father of Indira, and Feroze Gandhi is 
the father of Rajeev Gandhi. This gives us pairs (Nehru, Indira) and (Feroze Gandhi, 
Rajeev Gandhi). If we look at the set R of all pairs (a, b), where a is father of b, then 
the set R faithfully describes the relation of ‘is father of.’ One is genuinely tempted 
to define a relation as a set of ordered pairs. 


Definition 2.2.4 A subset R of X x X is called a relation on X. If (x, y) € R, then 
we Say that x is related to y under the relation R. We also express it by writing xRy. 
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Example 2.2.5 § is a relation on X in which no pair of elements in X are related. 
X x X is the largest (universal) relation on X in which each pair of elements in X is 
related. 


Example 2.2.6 A = {(x,x) | x € X} is a relation on X called the diagonal relation 
on X. This is the most selfish relation on X. 


Example 2.2.7 Let X = {a,b,c}. R = {(a, b), (b, a), (a, c)} is a relation on X. 


Example 2.2.8 Let X be a set. Then, R = {(a, b) | a,b € X and a € b} is a relation 
on X. 


Example 2.2.9 Let X be a set. Then, R = {(A, B) | A,B € —(X) and A C Bhisa 
relation on £9 (X). 


Let R and S be relations on X. Then, R LU) S, R (] S, and R—S are all subsets of X x X, 
and hence, they are also relations on X. 


Definition 2.2.10 Let R and S be relations on X. The relation 
RoS = {(x,z) € X x X | (x, y) € Sand (y, z) € R for some y € X} 
is called the composition of R and S. 
Proposition 2.2.11 Let R, S, and T be relations on X. Then, 
(RoS)oT = Ro(SoT). 
Proof Let (x, y) € (RoS)oT. By the definition, 
there exists z € X such that (x, z) € T, and (z, y) € RoS. 
Again, by the definition, 
there exist z and u € X such that (x, z) € T, (z, u) € S, and (u, y) € R. 


Thus, 
there exists u € X such that (x, u) € SoT, and (u, y) € R. 


Hence, (x,y) € Ro(SoT). This shows that (RoS)oT C Ro(SoT). Similarly, 
Ro(SoT) © (RoS)oT. By the axiom of extension, the result follows. t 


Proposition 2.2.12 RoA = R = AoR. 


Proof Since (x, x) € A for all x € X, (x, y) € RoA if and only if (x, y) € R. This 
proves that RoA = R. Similarly, R = AoR. tt 
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Proposition 2.2.13 Let R, S and T be relations on X. Then 


(i) Ro(S UT) = (RoS) U(RoP) 
(ii) Ro(S()\T) © (RoS) (\(RoT) 
(iii) (RJ S)oT = (RoT) U(SoT) 
(iv) (R(\ S)oT C (RoT) (\(SoT) 


Proof (i) Let (x, y) € Ro(S J T). By the definition, 
there exists z € X such that (x,z) € S$ U T, and (z,y) ER. 
Thus, 
there exists z € X such that ((x, z) € S, and (z, y) € R) or (x, z) € T, and (z, y) € R). 


In turn, it follows that ‘(x,y) € (RoS) or (x,y) € (RoT).’ Hence, (x,y) € 
(RoS) ()(RoT). This shows that Ro(S J T) © (RoS) LJ(RoT). Similarly, (RoS) U 
(RoT) C Ro(S (J T). By the axiom of extension, Ro(S J T) = (RoS) U)(RoT). 

Similarly, we can prove the rest. ft 


Example 2.2.14 Let X = {a, b,c}. Let R = {(a, b), (a, c)} and S = {(b, c), (b, b)}. 
Then RoS = %, and SoR = {(a, c), (a, b)} = R(verify). Thus, RoS need not be SoR. 
Observe that R = SoR = AoR, and S # A. If we take T = {(a, a), (b,c), (b, b)}, 
then RoT = {(a,c), (a, b)} = R and ToR = R. But T ~ A. Thus, RoT = R = ToR 
need not imply that T = A. 


Definition 2.2.15 Let R be a relation on X. Then, the relation 
R" ={(@,y) €X x X | (y, x) € R} 


is called the inverse of R. 


Example 2.2.16 LetR = {(a,b), (a,c)} be a relation on the set X = {a,b,c}. 
Then, R~! = {(b, a), (c, a)}. Now, RoR™! = {(b, b), (c, c)}, and R7'oR = {(a, a)}. 
Thus, here again, RoR“! # RoR. 


Proposition 2.2.17 Let R and S be relations on X. Then, 
OG) =k 
(ii) (RoS)~! = S~'oRT!. 


Proof Clearly, (x, y) € R if and only if (y,x) € R7!. Also, (y,x) € R7! if and 
only if (x,y) € (R7')7!. Thus, R = (R7!)~!. To prove (ii), let (x, y) € (RoS)~!. 
Then, (y, x) € RoS. Hence, there exists z € X such that (y, z) € S and (z,x) € R. 
Thus, (x,z) € R7!, and (z,y) € S~' for some z € X. But, then (x, y) € S-'oR“!, 
This shows that (RoS)~! C S~'oR7!. Similarly, S~'oR7! C (RoS)7!. t 
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Types of Relations 


Definition 2.2.18 A relation R on X is said to be 

(i) a reflexive relation if (x, x) € R for all x € X, or equivalently if A C R. 

(ii) a symmetric relation if (x, y) € R implies that (y, x) € R, or equivalently if 
| 

(iii) an antisymmetric relation if (x, y) € R and (y, x) € R implies that x = y, or 
equivalently if R()R7! C A. 

(iv) a transitive relation if when ever (x, y) € R and (y,z) € R, (x,z) € R, or 
equivalently if RoR C R. 


Example 2.2.19 Let X = {a, b, c} and 
R= {(a, a), (b, b), (c, c), (a, b), (b,c), (c, b)}. 


Then, R is reflexive but none of the rest of the three. 


Example 2.2.20 Let X = {a,b,c} and R = {(a, b), (b, a)}. Then, R is symmetric 
but none of the rest of the three. 


Example 2.2.21 LetX = {a, b,c}and R = {(c, b), (a, c)}. Then, R is antisymmetric 
but none of the rest of the three. 


Example 2.2.22 Let X = {a, b, c} and 
R = {(a, b), (b, a), (a, a), (b, b), (a,c), (b, c)}. 

Then, R is transitive but none of the rest of the three. 

Example 2.2.23 Let X = {a, b, c} and 

R= {(a, a), (b, b), (c, c), (a, b), (6, a), (, c), (c, BY}. 
Then, R is reflexive and symmetric but neither antisymmetric nor transitive. 
Example 2.2.24 Let X = {a, b, c} and 
R= {(,c), (c, b), (bb), (c, ©)}. 


Then, R is symmetric and transitive but neither reflexive nor antisymmetric. 


Proposition 2.2.25 Let R be a relation on X which is symmetric and transitive. 
Suppose that for all x € X, there exists y € X such that (x,y) € R. Then, R is 
reflexive. 


Proof Letx € X. Then, (x, y) € R for some y € X. Since R is symmetric, (y, x) € R. 
Since R is transitive, (x, x) € R. Thus, R is reflexive. tt 
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Example 2.2.26 The relation which is reflexive, symmetric, and antisymmetric is 
the diagonal relation. Thus, a reflexive, symmetric, and antisymmetric relations are 
also transitive. 


Exercises 


2.2.1 Suppose thatA x CC Bx C,C 4. Show that A C B. 


2.2.2 Show that (A x B = B x A) if and only if (A=@orB=@orA=B). 


2.2.3 Suppose that A, B, and C are nonempty sets. Is 
(A x B) x C=A x (B x C)? Support. 


2.2.4 Show that (A() B) x (C()D) = (Ax C)(\(B x D). 


2.2.5" Suppose that A C A x A. Show that A = . 
Hint. Use the axiom of regularity. 


2.2.6" Suppose that A = A x B. Show that A = @. 


2.2.7 Suppose that A contains n elements and B contains m elements. Show that 
A x B contains n - m elements. 


2.2.8 Show that the number of relations on a set containing n elements is Qn 


2.2.9 Let X = {a,b,c}, R = {(a, b), (b,c), (c,a)} and S = {(a,a), (a,c), (b, b)}. 
Find out (i) R US, (ii) RS, (iii) RoS, and (iv) R-!. 


2.2.10 Show by means of an example that equality in Proposition 2.2.13 (ii) and (iv) 
need not hold. 


2.2.11 Find out the number of reflexive relations on a set containing n elements. 
Hint. A reflexive relation on X x X can be written as A U S,whereS CX xX—A. 


2.2.12 Find out the number of symmetric relations on a set containing n elements. 


2.2.13 Find out the number of antisymmetric relations ona set containing n elements. 


2.3 Equivalence Relation 


The concept of equality in mathematics is best described in terms of equivalence 
relations. 


Definition 2.3.1 A relation R on X which is reflexive, symmetric, and transitive is 
called an equivalence relation on X. 


Example 2.3.2. The diagonal relation A is the smallest equivalence relation on X. 
The universal relation X x X is the largest equivalence relation on X. The relation 
R = {(a, a), (b, b), (c, c), (a, b), (b, a)} is an equivalence relationon X = {a, b, c}. 
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Definition 2.3.3 Let R be an equivalence relation on X. Let x € X. The subset 
={yeX | (@,y) € R} 


is called the equivalence class of X modulo R determined by the element x. 


Thus, for example, the equivalence class A, of X modulo A determined by x is 
the singleton {x}. For the equivalence relation 
R = {(a,a), (b, b), (c, c), (a, b), (b, a)} on X = {a, b, c}, the equivalence classes 
are R, = {a,b} = R,andR,. = {c}. 

Since R is reflexive, (x, x) € R for all x € X, and hence, x € R, for all x € X. 


Proposition 2.3.4 Let R be an equivalence relation on X. Then, the following hold. 


(i) x € Ry forallx €X. 
(ti) Ry = R, if and only if (x,y) € R. 
(iii) Ry # Ry if and only if Ry () Ry = 9%. 


Proof (i) Since R is reflexive, (x, x) € R for all x € X, and hence x € R, for all 
xEeXx. 

(ii) Suppose that R, = Ry. Since R is an equivalence relation, y € R, = R,. Hence 
(x, y) € R. Conversely, suppose that (x, y) € R. Since R is symmetric, (y, x) € R. 
Let z € R,. Then, (x, z) € R. Since R is transitive, (y, z) € R. Thus, z € Ry. Hence, 
R, C Ry. Similarly, Ry C R,. This shows that R, = Ry. 

(iii) Suppose that R, (| Ry 4B. Letz € Ry (1) Ry. Then, (x, z) € Rand (y,z) ER. 
Since R is symmetric and transitive, (x, y) € R. It follows from (ii) that Ry = Ry. 
Clearly, if Ry (| Ry = Y, then R, A Ry, for x € Ry. tt 


Let X be a non emptyset. A set 9 of nonempty subsets of X is called a partition of 
X if the following hold. 


(i) Union of members of £9 is X, i.e., UsegA =X. 
(ii) If A and B are distinct members of g, then A () B = 9%. 


Corollary 2.3.5 Let R be an equivalence relation on X. Then, gor = {Rx | x € X} is 
a partition of X. 


Proof Follows from the above proposition. ft 


The partition fz is the partition determined by the equivalence relation R. The set 
@r is also denoted by X/R, and it is also called the quotient set of X modulo R. 


Proposition 2.3.6 Let s be a partition of X. Define a relation R° on X by R° = 
eee A x A. Then R® is an equivalences relation such that fogs = $9. 
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Proof Since union of members of go is X, givenx € X,x € AforsomeA € sg. Hence 
(x,x) € R® for all x € X. Thus, R® is reflexive. Suppose that (x, y) € R®. Then, 
there is an element A € g such that x, y € A, and so y,x € A. Hence, (y, x) € R®. 
Thus, R® is symmetric. Suppose that (x, y) € R® and (y, z) € R®. Then, there is 
an element A € go and an element B € g such that x,y € A and y,z € B. Since 
y€A()B,A()\B FW. Further, since is a partition, A = B. Hence, x,z EAE po. 
Thus, (x, z) € R®. This shows that R® is transitive. 

Next, R® is the member A of 9 such that x € A. Hence, ogo = g. tt 


Proposition 2.3.7 R®® = R for every equivalence relation R. 


Proof Suppose that (x, y) € R. Then x, y € Ry € sor. Hence (x, y) € R®*. Suppose 
that (x, y) € R®*®. Then there exists R, € gor such that x, y € R,. Hence, there is 
an element z € X such that (x,z) € R and (y, z) € R. Since R is symmetric and 
transitive, (x, y) € R. This shows that R = R®®. tt 


Remark 2.3.8 It is apparent from the above discussions that every partition can be 
realized faithfully as an equivalence relation, and every equivalence relation can be 
realized faithfully as a partition. 


Example 2.3.9 Let R be a relation (not necessarily equivalence) on X. Define Ry = 
{y € X | (x, y) € R}. Suppose that = {R, | x € X} is a partition of X. Can we 
infer that R is an equivalence relation? No. For example, take X = {a,b,c},R = 
{(a, b), (b, c), (c, a)}. Then, Ra = {b}, Rp, = {c}, Re = {a}. Thus, (Ra, Rp, Re} is a 
partition of X, whereas R is not an equivalence relation (it is neither reflexive nor 
symmetric nor transitive). 


Example 2.3.10 Let go © g(X) (not necessarily a partition). Consider the relation 
R® on X given by R° = {(x,y) | such that x,y € A for some A € 6}. Suppose 
that R® is an equivalence relation. Can we infer that 49 is a partition? Again, no. 
For example, take go = {{a, b}, {b, c}, {c, a}} C fo (X), where X = {a, b, c}. Then, 
R® = X x X is an equivalence relation. 


Exercises 


2.3.1 Let R and S be equivalence relations on X. Show that RoS is an equivalence 
relation if and only if RoS = SoR. 


2.3.2 Let p, denote the number of equivalence relations on a set containing n ele- 
ments. Show that 
Pri = dro ("C,) pr 


Hint. p, is the number of partitions of a set containing n elements. 


2.3.3 Let X = {a, b, c, d} and 
R = {(a, a), (b, b), (c, c), (d, d), (a, b), (b, c), (a,c), (b, a), (c, b), (c, a}. 


Show that R is an equivalence relation. Find soz. Can we find 
an other relation S such that fr = sos? Support. 
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2.3.4 Show that the intersection of symmetric relations is symmetric. Deduce that 
for every relation R on X, there is smallest symmetric relation containing R. This 
relation is called the symmetric closure of R. Find the symmetric closures of all the 
relations given above. 


2.3.5 Show that the intersection of transitive relations is transitive. Deduce that for 
every relation R on X, there is smallest transitive relation containing R. This relation 
is called the transitive closure of R. Find the transitive closures of all the relations 
given above. 


2.3.6 Show that the intersection of equivalence relations is equivalence relation. 
Deduce that for every relation R on X, there is smallest equivalence relation containing 
R. This relation is called the equivalence closure of R. Find the equivalence closures 
of all the relations given above. 


2.3.7 Is composite of two symmetric relations always symmetric? If not under what 
conditions it is symmetric. 


2.3.8 Is composite of two transitive relations always transitive? If not under what 
conditions it is transitive. 


2.4 Functions 


Let X and Y be sets. A subset f of X x Y (the Cartesian product) is called a function 
or a mapping (or a map) from X to Y if the following two conditions hold. 


(i) For all x € X, there exists y € Y such that (x, y) € f. 
Gi) If (x, yi) € f and (x, y2) € f, then y, = y2. 


X is called the domain, and Y is called the co-domain of f. If (x, y) € f, we write 
y = f(x) and call it the image of the element x € X under the map f. Thus, under 
this notation, f = {(x, f(x)) | x € X}. 

Intuitively, a function f from X to Y is an association or a correspondence which 
associates to each x € X, a unique y € Y which we denote by f(x). Thus, to define a 
map f from X to Y, it is sufficient to give a unique f(x) in Y for all x € X. Any two 
functions f and g from X to Y are equal if and only if f(x) = g(x) for allx € X. 

We also adopt the notation f : X¥ —~> Y to say that f is a map from X to Y. 

Let f be a map from X to Y and g be a map from Y to Z. Then, gof defined by 


gof = ((x,2) | (sy) €f and (y, 2) € g for some y € Y) 


is also a map from X to Z, and it is called the composite of f and g. Thus, the map 
gof from X to Z is given by (gof)(x) = g(f(x)) for all x € X. 

The subset A of X x X is also a map from X to X. This map is called the 
identity map on X, and it is denoted by Jy. Thus, /y(x) = x for all x € X. Clearly, 
folx =f = lyof for every map f from X to Y. 
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Let Y be a subset of X. Then, iy = {(), y) | y € Y} isa map from Y to X called the 
inclusion map from Y to X. This map is sometimes denoted by the symbol Y ~ X. 

Let f be a map from X to A, and Y a subset of X. The composition foiy is a map 
from Y to A, and it is called the restriction of f to Y. The map foiy is also denoted 
by f ly. 

Let X and Y be sets and y € Y. Then, X x {y} is a map f from X to Y such that 
f() = y for all x € X. This map is called a constant map. 

Let X and Y be sets. Consider the Cartesian product X x Y. The map p,; from 
X x Y to X defined by pi ((x, y)) = x is called the first projection and the map p2 
from X x Y to Y defined by p2((x, y)) = y is called the second projection map. 


Proposition 2.4.1 Let f be a map from X to Y, g a map from Y to Z, and h a map 
from Z to U. Then (hog)of = ho(gof). 


Proof Clearly, ((hog)of)(x) = (hog)(F(x)) = A(gf@))) = A(gef)(x)) = 
(ho(gof))(x) for all x € X. Hence ho(gof) = (hog)of. tt 


Let f be a map from X to Y. Then f C X x Y. Consider f~! = {(y, x) | (x, y) € f}. 
Then f~! C Y x X need not be a map from Y to X for two reasons: (i) for y € Y, there 
may not be any x € X such that (x, y) € f, and so there may not be any x € X such 
that (y, x) € f7!, (ii) Gy, x1) € fo! and (y, x2) € f7! need not imply that x; = x. 
Thus, f~! will be a map if and only if the following two conditions hold. 


(i) For ally € Y, there is an element x € X such that (x,y) € f. 
(ii) If (x1, y) € f and (x2, y) Ef, then x; = x2. 


A map f from X to Y is called a surjective map (also called an onto map) if 
for ally € Y, there is an element x € X such that (x, y) € f. Thus, f is a surjective 
map if for all y € Y, there is an element x € X such that f(x) = y. 

A map f from X to Y is called an injective map (also called a one — one map) 
if (x1, y) € f, (2, y) € f implies that x; = x2. Thus, f is injective map if whenever 
Sf) =f (2), x1 = x2. In other words, f is injective if whenever x, A x2, f(x1) 4 
f (2). 

A map f which is injective as well as surjective is called a bijective map (also 
called a one-one-onto map). 

Thus, f—! is a map if and only if f is bijective, and then, the map f~! is called the 
inverse of f. The inverse of a bijective map is also bijective. 


Example 2.4.2. An injective map need not be surjective. For example, take X = 
{a,b}, Y = {x, y, z}. Define a map f from X to Y by f(a) = x and f(b) = y. Then, 
f is injective but it is not surjective, for there is no element in X whose image is z. 


Example 2.4.3 A surjective map need not be injective. Take X = {a,b,c} and 
Y = {x, y}. Define a map f from X to Y by f(a) = x = f(b), f(c) = y. Then, f is 
surjective, but it is not injective. 
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Proposition 2.4.4 Let f be a bijective map from X to Y. Then, f—' is also a bijective 
map from Y to X. Also (i) (f~'!)~! =f, (ii) f~of = Ix, and fof~! = ly. 


Proof Let f be a bijective map. Then, we have already observed that f~! is a map 
from Y to X. Suppose that (y,,x) € f~! and (y2,x) € f~!. Then, (x, y;) € f and 
(x, y2) € f. Since f is a map, y; = y2. Thus, f! is injective. Let x € X, then 
(x, f(x)) € f, and hence, (f(x),x) € f~!. This shows that f~! is surjective. We 
also observe that (f~'of)(x) = f~'(f(x)) = x for all x € X, and (fof~')Q) = 
fG'0)) = y for all y € Y. Thus, f~!of = Ty, and fof~! = Iy. The fact that 
(f-!)~! =f follows from the definition of f~!. 


Proposition 2.4.5 (i) The composite of any two injective maps is an injective map, 
(ii) the composite of any two surjective maps is a surjective, and (iii) the composite 
of any two bijective maps is a bijective map. 


Proof (i) Let f be an injective map from X to Y and g be an injective map from Y to 
Z. Suppose (gof)(x1) = (gof)(x2). Then, g(f(x%1)) = g(f(x2)). Since g is injective, 
f(1) = f(@2). Further, since f is injective, x; = x2. Hence, gof is injective. 

(11) Suppose that f and g are surjective maps. Let z € Z. Since g is surjective, 
there exists an element y € Y such that g(y) = z. Again, since f is surjective, 
there exists an element x € X such that f(x) = y. But, then (gof)(x) = g(f(x)) = 
g(y) = z. Hence, gof is surjective. 

(iii) Follows from (i) and (ii). tt 


Proposition 2.4.6 Let f be a map from X to Y and g be a map from Y to Z. Then, 
the following hold. (i) If gof is surjective, then g is surjective. (ii) If gof is injective, 
then f is injective. 


Proof (i) Suppose that of gof is surjective. Let z € Z. Since gof is surjective, 

there exists x € X such that (gof)(x) = z, i.e., g(f(x)) = z. Hence, g is surjective. 
(ii) Suppose that gof is injective and f(x) = f(x). Then, g(f(x%1)) = g(f 2)), 

Le., (gof)(x1) = (gof) (x2). Since gof is injective, xj = x2. Hence, f is injective. { 


Corollary 2.4.7 If gof is bijective, then g is surjective and f is injective. tt 


Proposition 2.4.8 A map f from X to Y is injective if and only if it can be left 
canceled in the sense that if fog = foh, then g = h. A map f is surjective if and only 
if it can be right canceled in the sense that if gof = hof , then g = h. 


Proof Suppose that f is injective and fog = foh. Then, f(g(z)) = (fog)(z) = 
(foh)(z) = f(h(2)) for all z € Z. Since f is injective, g(z) = h(z) for all 
z € Z. This shows that g = _ h. Now, suppose that f is not injective. Then, 
there exist elements x1,xX2 € X such that x; # x2 and f(x1) = f(x). Take Z = 
{x1, X2}. Define a map g from Z to X by gv) = x; = g(x2) and a map hf from Z to 
X by h(x) = x2 = h(x2). Then, g 4 h but fog = foh. 
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Next, suppose that f is surjective and g, h are maps from Y to Z such that gof = 
hof. Then, g(f(x)) = h(f(x)) for all x € X. Since f is surjective, g(y) = h(y) for 
all y € Y. This shows that g = h. Now, suppose that f is not surjective. Then, 
there exists an element yo € Y such that yo € f(x) for all x € X. Take Z = {a, b}. 
Define a map g from Y to Z by g(vo) = a, gy) = b for all y ¥ yo, and a map hh from 
Y to Z by h(y) = b for all y € Y. Clearly, then, g 4 h and gof = hof. tt 


Corollary 2.4.9 A map f from X to Y is bijective if and only if it can be canceled 
from left as well as from right. tt 


Proposition 2.4.10 Let f be a map from X to Y. Then f is bijective if and only if 
there exists a map g from Y to X such that gof = Ix and fog = Ty. Further, then 
g=fo'. 

Proof If f is bijective, then f—'of = Iy and fof! = Iy (Proposition 2.4.4). Let 
g be a map from Y to X such that gof = ly and fog = Iy. Since gof = Ix is 
injective, f is injective. Since fog = Ty is surjective, f is surjective. Further, then 
f-'of =Ix = gof,and fof~' = ly = fog. The result follows from the above 
corollary. tt 


Corollary 2.4.11 Let f be a bijective map from X to Y, and g be a bijective map 
from Y to Z. Then (gof)~' = f~'og™'. 


Proof Clearly, 


(f-'og™')o(gof) = (f'o(g'og))of =f~'of = Ix. 


Similarly, 
(gof )o(f-'og7') = Iy. 


The result follows. tt 
Proposition 2.4.12 There is no surjective map from any set X to its power set so (X). 


Proof Let f be a map from X to #9 (X). Consider the set A = {x € X | x ¢ f(x)}. 
Then, A € g(X). Suppose that f(y) = A for some y € X. If y ¢ A = f(y), then 
yeA.IfyeA=f(y), then y ¢ f(y) =A. Hence, the supposition that f(y) = A for 
some y € X is false. This shows that f can not be surjective. tt 


Let X and Y be sets. The set of all maps from X to Y is denoted by ¥~*. 
What are X” and 6*? 


Example 2.4.13 Let X be a set, and 2 denotes the set {0,1}. Define a map ¢ from 
0 (X) to 2* by o(A)(x) = Oif x ¢ A and #(A)(x) = 1 if x € A. Check that the map 
¢ is bijective. 

Let f be a map from X to Y. Let A C X and B C Y. The subset f(A) = {f(a) | 
a € A} of Y is called the image of A under the map f. The subset f~!(B) = {x € X | 


f(x) € B} of X is called the inverse image of B under f. 
What are f~'(Y) and f—'(@)? To say that f is surjective is to say that f(X) = Y. 
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Proposition 2.4.14 Let f be a map from X to Y and A C X. Then A C f—!(f(A)). 
Also A = f—'(f(A)) for all A C X if and only if f is injective. 


Proof Let a € A. Then, f(a) € f(A), and hence, by the definition, a € f~'(f(A)). 
Thus, A C f—!(f(A)). Suppose that f is injective. Let x <¢ f~'(f(A)). Then, f(x) € 
F(A) (by def). Hence, there exists an element a € A such that f(x) = f(a). Since f is 
injective, x = a € A. Thus, f~!(f(A)) C A, and therefore, A = f—'(f(A)). Suppose 
that f is not injective. Then, there exist elements x,,x2. € X, x; 4% X2 such that 
(x1) =f 2) = y (say). Take A = {x1}. Then, f(A) = {y} and {x1, x2} C f~1(F(A)). 
Hence, A 4 f—!(f(A)). tt 


Proposition 2.4.15 Let f be a map from X to Y and B C Y. Then f (f~'(B)) C B. 
Also B = f (f~'(B)) for all B C Y if and only if f is surjective. 


Proof Let y € f(f~'(B)). Then, y = f(x) for some x € f~'(B). But then y = 
f(x) € B. Hence, f(f~'(B)) © B. Suppose that f is surjective and y € B. Then, 
there exists an element x € X such that f (x) = y. Clearly, x € f~'(B), and hence, y = 
f(x) € f(f—-'(B)). Therefore, B = f(f~'(B)). Suppose now that f is not surjective. 
Then, there exists an element b € Y such that b ¢ f (X). But, then f—'({b}) = @, and 
hence, f(f~!({b})) = 0 # {b}. t 


Proposition 2.4.16 Letf be amap from X to Y. Let A, and Az be subsets of X. Then, 
the following hold. 


(i) f(A; UA2) = f(A1) Uf 2). 
(ii) f(Ai (Az) S f(A) (1 f 2). 


Further, in (ii), equality holds for every pair of subsets A, and Az of X if and only if 
f is injective. 


Proof The proof of (i) and (ii) is left as exercises. We prove the last assertion. Suppose 
now that f is injective. Let y € f(A1) ()f(A2). Then, there is an element a € A; and 
an element b € A) such that y = f(a) = f(b). Since f is injective, a = b € A; (| Ao, 
and so y = f(a) € f(Ai ()A2). Thus, f(A1) (\f(A2) © f(A1 () A2). But already 
(from (ii)) f (Ai 1) A2) C f (Ar) () f (Az). Thus, equality holds in (ii) if f is injective. 
Conversely, suppose that f is not injective. Then, we have two distinct elements 
X1,X2 in X such that f(x1) = f(z) = b (say). Take Ay = {x1}, Az = {x2}. Then, 
f(Ai (Az) =f @) = G, whereas f (A1) (Vf (Aa) = (b} #9. t 


Proposition 2.4.17 Letf be a map from X to Y. Let B, and Bz be subsets of Y. Then, 
the following hold. 


(i) f~'(Bi (1) Bo) = f7' (Bi) VF (Ba). 

(ii) f-' (Bi U Br) =f" (B1) Uf >). 

(iii) f~' (By, — By) = f~'(B1) — f' (Ba). 

Proof (i) Let x € f~'(B, () Bz). By the definition, f(x) € By () Bz. Thus, f(x) € 
B, and f(x) € Bo. This implies that x €¢ f~'(B,) and x € f—!(B2). In turn, x € 
f-' (By) (\f7!(B2). This shows that f~!(B; (1) Bo) © f-'(B1) 1) f~| (Bo). Similarly, 
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f (BY) A f7'(2) © f-'(Bi () B2). This proves (i). Similarly, we can prove the 
rest of the two. tt 


Family of sets. 

Let J be a set and X be a set of sets. A surjective map A from J to X is called a 
family of sets. We denote the image A(q@) of a by Aq. This family of sets is denoted 
by {Aq | a € [}. The set / is called the indexing set of the family. 

Let {A, | a € I} be a family of sets. Then, the set 


[Ao = {x | x € Ag for some a € T} 


ael 


is called the union of the family, and 


(Ao = {| x € Ay for all a € 1} 


ael 
is called the intersection of the family. 


Proposition 2.4.18 (De Morgan’s Law) Let X be a set and {Aq | a € I} be a family 
of sets. Then, {X — Aq | a € I} is another family of sets and 


(i) X ~ (Uae) = Naver X — Aq). 
(ti) X ~ (Maer Aa) = Uoer(X — Aq). 


The proof of the above proposition is left as an exercise. 

Let {X;, i € {1,2}} = {X1, Xo} be a family of sets containing only two sets 
X, and Xz. An element (x1, x2) of the Cartesian product X; x X2 can be faithfully 
realized as a map x from {1, 2} to X; U X2 with x(1) = x; andx(2) = x. This 
prompts us to define the Cartesian product of an arbitrary family as follows: 


Definition 2.4.19 Let {X, | a € J} be a family of sets. Let [],,-; Xq denote the set 
of all maps x from J to L),,<; Xa with the property that x(a) € X, for all a € I. The 
set [],<; Xo is called the Cartesian product of the family. 

Further, for each ag € J, the map po, from [],<; Xa to Xa, defined by pay (x) = x(a0) 
is called the a‘ projection map. 


ael 


The Axioms 1-9 constitute the Zermelo—Fraenkel (ZF) axiomatic system for set 
theory. 

Consider the set X of countries in the world. How to select a unique city in each 
country? More explicitly, how to get a map c from the set X to the set of all cities 
in the world so that c(A) € A for all countries A in X. Here, we can give a rule to 
define the map c by saying that c(A) is the capital of the country A. In general, if 
{X, | a € I} isanonempty family of nonempty sets, how to chose a unique member 
from each class. The following is an other fundamental and important axiom of set 
theory which ensures the existence of such a map. 
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Axiom 10 (Axiom of Choice) Let {X, | a € I} be a nonempty family of nonempty 
sets (i.e., / is nonempty, and X, 4 Y for all a € 1). Then, [],-; Xa is nonempty set. 
More explicitly, there exists a map c from I to J,e; Xa (called a choice function) 
such that c(a) € Xq for alla € I. 


ael 


Remark 2.4.20 K. Godel in 1932 proved that the axiom of choice is consistent with 
the ZF axiomatic system. More explicitly, the negation of the axiom of choice is 
not a theorem in the ZF axiomatic system. Later, P. Cohn established that the axiom 
of choice is not a theorem in ZF axiomatic system. In turn, the axiom of choice is 
independent of the ZF axiomatic system. It also follows that the ZF axiomatic system 
is incomplete. The Axioms 1-10 constitute ZFC axiomatic system. The axiomatic 
system ZFC is also incomplete. Consider the following hypothesis: ‘If there is an 
injective map from N to X, and there is an injective map from X to 2, then there is a 
bijective from N to X, or else there is a bijective map from X to 2.’ This hypothesis is 
called the continuum hypothesis (CH). Godel and Cohen proved that the continuum 
hopothesis is independent of the ZFC axiomatic system. The Whitehead problem in 
group theory asks: ‘Is every abelian group A with EXT!(A, Z) = {0} a free abelian 
group?’ The Whitehead problem is also an undecidable proposition in ZFC. 


Letf bea map from X to Y and g amap from Z to U. Then, the map/f x g from X x Z 
to Y x U defined by (f x g)((x, z)) = (f@), g(z)) is called the Cartesian product 
of the map f with the map g. Clearly, products of injective maps are injective maps, 
and those of surjective maps are surjective. 

Let f be a map from X to Y and S an equivalence relation on Y. Then, (f x f)~!(S) 
is an equivalence relation on X (verify). Let R be an equivalence relation on X. Then, 
(f x f)(R) need not be an equivalence relation on Y even if f is surjective (give an 
example to support this). 

The equivalence relation (f x f)~'(A) on X is called the kernel of f, and it is 
denoted by kerf. It follows from the definitions that f is injective if and only if 
kerf = A (the diagonal relation on X). 


Proposition 2.4.21 Letf be a surjective map from X to Y. Let R be an equivalence 
relation on X containing the kernel of f. Then (f x f)(R) is an equivalence relation 
on Y such that (f x f)"'(f x f)(R)) = R. 


Proof Clearly, (f x f)(R) is symmetric. Since f is surjective, (f x f)(R) is also 
reflexive. We prove that it is transitive also. Let (u, v), (v, w) € (f x f)(R). Then, 
there exist (x, y), (z, t) € Rsuch that (f(x), f(y)) = (u, v) and (f(z), f@) = (v, w). 
This shows that f(y) = f(z) = v. Hence, (y, z) € (f xf) !(A) = kerf CR. Since R 
is transitive, (x, t) € R. But, then (u, w) = (f(x), f@®) € F& xf)(R). Thus, (f xf) (R) 
is an equivalence relation. Finally, we show that (f x f)~'((f x f)(R)) = R. Clearly, 
RCO xf) Uf x f)(R)). Let (x,y) € (f x fy TF x f)(R)). Then, (f(x), f()) € 
(f xf)(R). Hence, there exists (z, t) € Rsuch that (f(x), f(y) = (F(z), f()). But then 
f(x) =f (z) and f(y) = f(t). This shows that (x, z) and (y, t) belong to (f x f)~!(A). 
Since (f x f)~!(A) is supposed to be contained in R, (x, z), (y, f) and (z, f) are all 
in R. Since R is an equivalence relation, (x, y) € R. This completes the proof. tt 
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Corollary 2.4.22 (Correspondence Theorem) Let f be a surjective map from X to 
Y. Let R(X) denote the set of all equivalence relations on X containing kerf and 
R(Y) the set of all equivalence relations on Y. Then, f induces a bijective map 
f from R(X) to R(Y) defined by f (R) = (f x f)(R). 


Proof From the above proposition, it follows that (f xf)(R) € R(Y)for all R € R(X). 
Thus, f is a map from R(X) to R(Y). Since f is surjective, f x f is also surjective. 
Hence, (f xf)((f xf)! (S)) = S forall S € R(Y). This shows that f is surjective (note 
that (f x f)~!(S) € R(X)). Further, suppose that f(R;) = f(R2). Then, (f x f)(R1) = 
(f xf) (R2). Since Ry and R2 are equivalence relations containing ker f, it follows from 
the above proposition that R; = (f xf) '(f xf)(R1)) = F xf)! (fF xf) (Ro) = Ro. 
This proves that f is injective. tt 


Let X be a set and R be an equivalence relation on X. Consider the quotient set 
X/R = {R, | x € X}. The map v from X to X/R defined by v(x) = R, is called the 
quotient map. Clearly, v is surjective and (v x v)~'(A) = {(v, y) | Ry = v(x) = 
v(y) = Ry} = R. Thus, every equivalence relation is kernel of a map. We shall 
show that if f is a surjective map from X to Y, then Y can be realized as a quotient 
set through a bijective map. 


Theorem 2.4.23 Let f be a surjective map from X to Y. Let R be an equivalence 
relation on X containing ker f. Let S = (f x f)(R). Then, there is a bijective map f 
from X/R to Y/S such that the diagram 


a rr 


XR OVS 


is commutative. 


Proof Suppose that Ry, = R,,. Then, (x1,x2) € R, and so (f(x), f(2)) € (Ff x 
f)(R) = S. Hence, Sy.) = Seo). This shows that we have a map f from X/R to 
Y/S defined by f(R,) = Sfx) Further, since f is surjective, every member of Y/R 
is of the form Spi) = f (Ry). This shows that f is surjective. Suppose that f (R,,) — 
7). Then, Sy(x,) = Sf(x)- This means that (f(x), f@2)) ¢ S = (f x f)(R). In 
turn, (x1,.x2) € (f x f)~'(f x f)(R)). From the Proposition 2.4.21, it follows that 
(x1, x2) € R. This means that R,;, = R,,, and sof is also injective. The commutativity 
of the diagram is evident. tt 
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Corollary 2.4.24 (Fundamental Theorem of Maps) Let f be a surjective map from 
X to Y, andR = (f x f)~'(A) = kerf. Then there is a bijective map from X/R to 
Y such that pov = f. 


Proof Clearly, (f x f)((f x f)~'(A)) = A. Take S = A in the above theorem. 
One also observes that the quotient map v from Y to Y/A is bijective map given by 
v(y) = fy}. Take @ = v~!of. The result follows from the above theorem. tt 


Exercises 


2.4.1 Let X bea finite set containing n elements and Y be a set containing m elements. 
Suppose that n < m. Find the number of injective maps from X to Y. What happens 
ifm <n? 


2.4.2 Find the number of surjective maps from a set containing n elements to a set 
containing m elements. 


2.4.3, Let X be a set. Show that there is no injective map from P(X) to X. 


2.4.4 Let X” denote the set of all maps Y to X. Suppose that X 4 @. Show that there 
is a surjective map from Y to X” if and only if X is a singleton set. 


2.4.5 LetX, Y, and Z be sets. Show that there is a bijective map from X"*7 to (X")7. 


2.4.6 Let R and S be two equivalence relations on a set X such that R C S. Show 
that there is a bijective map ¢ : X/S —> (X/R)/(v x v)(S) such that the diagram 
formed by quotient maps is commutative. 


2.4.7 Let {Xq | a € J} be a family of nonempty sets. Show that each projection map 
is a surjective map. 
Hint. Use the axiom of choice. 


2.4.8 Let f : X —> Y bea surjective map. Show that there is an injective map ¢ 
from Y to X such that fot = Iy. 
Hint. Use the axiom of choice. 


2.4.9 Let f : X —> Y be an injective map. Show that there is a surjective map 
s: Y —> X such that sof = ly. 


2.4.10 Let X be a nonempty set. Show that the following conditions on X are equiv- 
alent: 


(i) Every injective map from X to X is surjective. 
(ii) Every surjective map from X to X is injective. 
(iii) Every injective map from X to X is bijective. 


Hint. Use the Exercises 2.4.8 and 2.4.9. 
A set satisfying the condition in Exercise 2.4.10 is called a finite set. A set which is 
not finite is called an infinite set. 
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2.4.11 Show that every subset of a finite set is finite, and every set containing an 
infinite set is infinite. 


2.4.12 Show that the union of two finite sets is finite. 


2.4.13 Show that every successor set is infinite (This justifies the name ‘Axiom of 
Infinity’ for the existence of successor set). 

Hint. If X is a successor set, then the map x <— x* is an injective map from X | ){H} 
to itself which is not surjective. 


2.4.14 Show that f(A) — f(B) C f(A — B). Show further that the equality holds 
provided that f is injective. 


2.5 Partial Order 


Let X be a set. A relation R on X is called a partial order if it is reflexive, anti- 
symmetric, and transitive. Usually, a partial order is denoted by ‘<.’ A pair (X, <), 
where < is a partial order on X, is called a partially ordered set. 


Example 2.5.1 Let Y be a set and X = g(Y). Then, the relation {(A, B) | A C B} 
is a partial order, and it is called the inclusion relation on X. We denote this relation 
also by C. Thus, (X, C) is a partial ordered set. Note that the inverse of a partial 
order is also a partial order. Thus, > is also a partial order on X. 


Example 2.5.2 Let X = {a, b,c, d}. Then, 
R = {(a, a), (b, b), (c, c), (d, 4), (a, b), (c, d)} 


is a partial order on X. 


Let (X, <) be a partially ordered set and Y be a subset of X. Then, the induced 
relation on Y is also a partial order on Y which is denoted by <y. 

A partial order < on X is called a total order if given x, yin X,x < yory < x. 
Example 2.5.2 is not a total order. Example 2.5.1 is a total order if and only if Y is 
singleton (prove it). 


Example 2.5.3 Let X = {a, b,c, d} and 
R= {(a, a), (b, b), (c, c), (d, d), (a, b), (b, c), (a,c), (c, d), (a, d), (b, d)}. 


Then, R is a total order on X 


Let (X, <) bea partially ordered set. A subset Y of X is called a chain in X if the 
induced partial order on Y is a total order on Y. 


Example 2.5.4 Let Y = {a,b,c} and X = g(Y). Then, the inclusion relation is a 
partial order on X. The subset Z = {@, {a}, {a, b}, {a, b, c}} is a chain in X. 
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Let (X, <) be a partially ordered set and A C X. An element x € X is called an 
upper bound (lower bound) of A if a < x(x < a) forall ainA. 


Remark 2.5.5. A subset of a partially ordered set need not have any upper bound 
(lower bound). It may have several upper bounds (lower bounds). Give examples to 
support it. 


Let (X <) be a partially ordered set. An element a € X is called a maximal 
(minimal) element if a < x(x < a) implies that x = a. 

In Example 2.5.2, b and d are maximal elements, whereas a and c are minimal 
elements of X. Thus, there may be so many maximal or minimal elements of a 
partially ordered set. There may not be any maximal or minimal elements (give 
examples to support it). 


Example 2.5.6 Let X = go(Y) — {Y,@}, where Y = {a,b,c}. Then, X is a par- 
tially ordered set with respect to inclusion. Clearly, {a, b}, {b, c}, {a, c} are maximal 
elements and {a}, {b}, {c} are minimal elements. 


Example 2.5.7 LetX = {a,b,c} and A = {(a, a), (b, b), (c, c)}. Then, A is a partial 
order on X such that each element is maximal and also each element is minimal. 


Example 2.5.8 Let Y be an infinite set and X be the set of all finite subsets of Y. 
Then, X is a partially ordered set with respect to inclusion relation which has no 
maximal element. If we take the set Z of infinite subsets of Y, then it has no minimal 
elements. 


Let (X <) bea partially ordered set. An element a € X is called the largest (least) 
element of X ifx < a(a < x) for all x € X. If x; and x2 are largest (least) elements of 
X,X1 < x2 and x2 < x,. By the antisymmetry of <, x; = x2. Thus, there is a unique 
largest (least) element in a partially ordered set provided it exists. 

It may be observed that a largest (least) element is also a maximal(minimal) but a 
maximal (minimal) element need not be the largest (least). In Example 2.5.2, b and 
d (a and c) are maximal (minimal) but none of them are largest (least). It may also 
be noticed that largest (least) need not exist(see Example 2.5.6). 

Let (X <) be a partially ordered set and A C X. Let U(A)(L(A)) denote the 
set of all upper (lower) bounds of A (note that U(A)(L(A)) may be empty sets also). 
Then, < induces a partial order on U(A)(L(A)). Note that all elements of A are lower 
(upper) bounds of U(A)(L(A)). Thus, A C L(U(A))\(A CC U(L(A))). 
The least (largest) element of U/(A)(L(A)) (if exists) is called the 
least upper bound(greatest lower bound) of A. The least upper bound (greatest 
lower bound) of A is denoted by ].u.b(A)(g.1.b(A)) or supA(infA). If A has the 
largest (least) element, then that is the l.u.b(g.1.b) of A. 


Remark 2.5.9 Least upper bound (greatest lower bound) need not exist even if A has 
upper (lower) bounds: Let Y = {a, b,c, d}andX = g(Y) —{{a, b}}. Then, C defines 
a partial order on X. Take A = {{a}, {b}}. Then, U(A) = {{a, b,c}, {a, b, d}, Y}. 
Clearly, U(A) has no least element. Thus, A has no 1.u.b. 
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Theorem 2.5.10 Let(X <) be apartially ordered set. Then, the following conditions 
are equivalent. 

(1) Every nonempty subset of X which has an upper bound has least upper bound in 
Xx. 

(2) Every nonempty subset of X which has a lower bound has greatest lower bound. 


Proof Assume 1. Let A be a nonempty subset of X which has a lower bound. Then, 
L(A) 4 %. Clearly, @ 4 A C U(L(A)). Hence, L(A) has an upper bound. By 1, L(A) 
has the least upper bound a (say). Since a is the least element of U(L(A)) (by the 
definition of l.u.b) and A C U(L(A)), a < x forall x € A. Thus, a € L(A). Further, if 
y € L(A), then y < x for all x € U(L(A)). In particular, y < a. Thus, a is the largest 
element of L(A). This shows that a is the g.1.bA. 

The proof of 2 ==> | is similar. ft 


A partial order < on X is called a complete order if it satisfies any one (and hence 
both) of the equivalent conditions of the above theorem. 

Let (X, <) be a partially ordered set. A subset Y of X is called an initial segment 
of X if y € Y andx < y implies that x € Y. Thus, X itself is an initial segment of 
(X, <). For each x € X, the subseto, = {y € Y | y < x} is an initial segment 
of X associated with the element x € X. The map ao from X to s9(X) defined by 
a(x) = o, 18 an injective map from X to 9 (X) which is order preserving in the sense 
that “x < y => o, Co,,.’ Again, for each x € X, the subset 7, = {y €X | y < x} 
is also an initial segment. This initial segment is called the strict initial segment 
associated with x. 

A partial order < on X is called a well-order if every nonempty subset of X has the 
least element. A pair (X, <), where < is a well-order, is called a well-ordered set. 
Every well-order is a total order: Let < be a well-order on X. Let x, y € X. Then, 
{x, y} is a nonempty subset of X. Since < is a well-order, {x, y} has a least element. 
If x is the least element, then x < y; if y is the least element, then y < x. This proves 
that every well-order is a total order. Indeed, a well-order is a complete order. For, 
suppose that < is a well-order on X. Let A be a nonempty subset of X which has an 
upper bound. Then, the set U(A) of upper bounds of A is a nonempty subset of X. 
Since < is a well-order on X, U(X) has the least element a (say). Evidently, a is the 
least upper bound of A. A complete order need not be a well-order. For example, the 
inclusion relation on the power set 99 (X) of X = {a,b,c} is a complete order, but 
it is not a well-order. 


Proposition 2.5.11 Let (X, <) be a well-ordered set. Then, a proper subset Y of X 
is an initial segment if and only if it is strict initial segment n, for some x € X. It 
need not be ox for any x € X. 


Proof Let Y be a proper subset of X. Then, X — Y 4 WY. Since (X, <x) is a well- 
ordered set, X — Y has least element x (say). Clearly, 7, C Y. Since Y is an initial 
segment, x hy y for any y € Y. This shows that Y C n,. Thus, Y = 7,. Note that 
the successor N* of N is a well-ordered set with usual inclusion ordering, and N is a 
proper subset of Nt which is an initial segment, but it is not 0, foranyx Ee Nt. # 
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Finally, we state and prove the two important equivalents of axiom of choice which 
are commonly used in mathematics. 

Zorn’s Lemma: Let (X, <) be a nonempty partially ordered set in which every 
chain has an upper bound. Then, (X, <) has a maximal element. 

Well-ordering principle: On every set, there is a well-order. 


Theorem 2.5.12 The following are equivalent: 


(1) Axiom of choice. 
(2) Zorn’s lemma. 
(3) Well-ordering principle. 


Proof The following is the scheme of the proof. We shall prove that 2 => 3, 3 => 
1, and then 1 => 2. 

(2 => 3). Assume 2. Let X be a set. We have to show the existence of a well- 
order on X. If X = 9, then there is nothing to do. Assume that X is a nonempty set. 
Consider the set & given by 


x = {(Y, <y) |< isa well-order on Y, where Y C X}. 


If x € X, then there is the unique partial order <;,; on {x} which is a well-order. 
Thus, ({x}, <,) € &. Hence, & is nonempty set. We say that (Y, <y) < (Z, <z) if 
(Y,<y) = (Z,<z) orelse Y C Z, <7 /Y =<y,andY = 7, for some z € Z. 
Clearly, (2, <) is a nonempty partially ordered set. Let Q = {(Yq, <y,) | a € A} 
be a chain in (X <). Take Yo = sex Y,. Then, there is a unique order <y, 
on Yo whose restriction to each Y, is <y,. If A is a nonempty subset of Yo, then 
Af) Ya. 4 @ for some ag € A. If (Ya, <y,) < (Yoo: <y,,) for all a € A, then 
Yo = Yq, and so A has the least element. If not, then there is an element a € A 
such that (Y,,, <y,,) < (Yq, <y,). Hence, there is an element x € Yo such that 
Yoo = Nx 

Let a be the least element of A ()} Yq, Let b be any element of A. Then, b is not 
strictly less that a, for then b will be a member of A{ Y,,,. Hence, a <y, b. Thus, 
a is the least element of A. It follows that (Yo, <y,) is a well-ordered set, and it is 
an upper bound of (2. This shows that every chain in (X, <) has an upper bound. 
By the Zorn’s lemma, there is a maximal element (M, <y) of (2, <). We show 
that M@ = X. Suppose not. Then, there is an element x) € X — M. Consider the 
set L = M \){xo}. Extend the well-order <y on M to the well-order <, on L by 
defining x <; Xo for all x € M. Clearly, (L, <~) € &, and it is larger than (M, <,). 
This is a contradiction to the maximality of (MW, <,). Thus, M = X, and <y isa 
well-order on X. This completes the proof of 2 => 3. 

(3 => 1). Assume 3. Let {X, | a € A} be a nonempty family of nonempty sets. 
By the well-ordering principle, there is a well-order <, on X, for each a. For each 
a € A, let c(a) denote the least element of X,. This gives us a map c from A to 
cm A Xa Such that c(a@) € X,. This completes the proof of 3 => 1. 

(1 => 2). Assume 1. Let (X, <) be a nonempty partially ordered set in which 
every chain has an upper bound. We need to show the existence of a maximal element. 
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Recall the map o from X to 9 (X) given by o(x) = o,, whereo, = {ye X | y < x} 
is the initial segment associated with x. Clearly, o is an injective map which is order 
preserving in the sense that ‘x < y if and only if a(x) C a(y).’ Consider Y = a(X). 
Then, (X, <) is order isomorphic to (Y, C). It is sufficient, therefore, to show that 
(Y, ©) has a maximal element. Let C(X) denote the set of all chains in X. Then, 
(C(X), C) is also a partially ordered set. Further, since every chain in (X, <) has 
an upper bound, every member of C(X) is contained in a member of Y. Thus, Y is 
co-final in (C(X), C). It follows that the maximal members of (Y, C) are same as 
those of (C(X), C). It is sufficient, therefore, to show that (C(X), C) has a maximal 
element. 
Now, C(X) satisfies the following two properties: 


(i) If A € C(X), then all subsets of A also belong to C(X). In particular, @ € C(X). 
(ii) If T is a chain in (C(X), C), then WaanA € C(X). 


By the axiom of choice, we have a map c from go (X) — {@} to X such that c(A) € A 
for all A € 99 (X) — {0}. For each A € C(X), consider the setA = {xe X |AULx} € 
C(X)}. To say that A is maximal in (C(X), C) is to say thatA = A. Define a map 
x from C(X) to X by x(A) = AifA — A = G,andy(A) = AUf{c(A — A)} if 
Aaa + %. We need to show that there is an element A € C(X) such that x(A) = A. 

Let us call a subset & of C(X) to be a tower in C(X) if the following 3 conditions 
hold. 


(i) Pex. 
(ii) x(A) € & forall A € &. 
(iii) If T is a chain in (, C), then User A eX. 


Clearly, C(X) is a tower, and the intersection of a family of towers is a tower. Let Xo 
denote the smallest tower in C(X). Indeed, it is the intersection of all towers in C(X). 
It is sufficient to show that Xo is achain in (C(X), C). For, then B = Uses, A€é Xo, 
and so x(B) € Xp. Since x(B) C B, it follows that x(B) = B. 

Now, we show that Xo is a chain in C(X). More explicitly, we need to show that 
for any pair A, B € Xo, A C B, or B CA. Let 


YT = {Ae Xo | forall Be Xo, AC Bor BCA}. 
Clearly, @ € l’. Let A € TI’. Consider 
Ty = {Be Xo|BCAor y(A) CB}. 


We show that I’, is tower. Clearly, @ € Ty. Let B € Ty. Then, BC A or x(A) CB. 
Suppose that B C A. If B = A, then y(A) = y(B), and so in this case, yx(B) € T'4. 
Suppose that B Cc A. Then, y(B) C A. For, if not, then, since A €¢ T, A C y(B). 
This is not true, for y(B) contains at the most one more element than B. Thus, in 
this case also, y(B) € T',. Finally, if y(A) C B, then y(A) C x(B). In this case also, 
x(B) ely. 
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Let {B, | a € A} be achain in I’,. Then, from the definition of I',, either each 
B,, is contained in A or x(A) is contained in some B,. This shows that J BCA 
ot ¥(A) © Uncen Ba. Hence, 24 Ba € Ty. 

This completes the proof of the fact that I’, is a tower contained in Xo. Since Xo 
is the smallest tower, it follows that (4 = Xo. 

Finally, we prove that" = Yo. Indeed, again, we prove that I" is a tower. Clearly, 
®€T.LetA € I. Consider y(A). Let B € Xo. Then, from what we have proved 
above, I, = po, and so B € Ty. Hence, B CA C y(A), or y(A) C B. This 
shows that x(A) € I’. Let {A, | a € A} bea chain inl’. Let B € Xo. Then, either 


aeA 


each A, is contained in B, or B C A, for some a. This means that Wen Ag © B, or 
BC Use Aa: This means that L),., Aa € FP. Hence, Fis a tower. Inturn, PF = Xo. 
Hence, Xo is a chain. tt 


Exercises 


2.5.1 Let (X, <) bea partially ordered set. Let A C B. Show that U(A) > U(B) and 
L(A) > L(B). 


2.5.2 Show that U(A) = U(L(U(A))) and L(A) = L(U(L(A))). 
2.5.3 Show that g.l.b need not exist. 


2.5.4 Let A C B. Show that 


(i) g.LbB <g.l.bA 
(ii) 1u.bA <1u.bB. 


2.5.5 Show by means of an example that l.u.bA need not belong to A. 
2.5.6 Show that (P(X), C) is order complete. 
2.5.7 Give an example of a partially ordered set which is not complete. 


2.5.8 A partially ordered set (L, <) is called a lattice if any pair of points a, b has 
the least upper bound denoted by a \/ b as well as the greatest lower bound a /\ b. 
Show that (P(X), C) is a lattice. 


2.5.9 Let (X, <x) and (Y, <y) be well-ordered sets. Show that X x Y with dictionary 
order is a well-ordered set. 


2.5.10 Let f be a surjective map from X to Y. Use axiom of choice to show the 
existence of an injective map g from Y to X such that fog is the identity map on Y. 


2.6 Ordinal Numbers 


Definition 2.6.1 A well-ordered set (a, <) is called an ordinal number if for each 
x € a, the strict initial segment 7, = {a € a|a < x} is same as x. 
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There is a unique well-order 6 = @¢ x ¢ on the set ¢. Clearly, the statement 
Vx €b,ny = {a€a|a<x} = x’ is vacuously satisfied. Thus, the J together 
with this ordering is an ordinal. This ordinal number is denoted by 0. However, if 
(a, <) is an ordinal number, where a 4 @, then % € a, and indeed, ¢ is the least 
element of qa. For, by the definition of an ordinal number, if x is the least element of 
a, then? = {aeala<x} =m = x. 


Example 2.6.2 Consider the set 1 = {@}. There is only one well-order < on 1. The 
strict initial segment ny = {x | x < B} = J. It follows that (1, <) is an ordinal 
number. The set 2 = {¢, {d}} with the inclusion ordering is clearly an ordinal 
number. Indeed, all natural numbers are ordinals. 


Example 2.6.3 Consider the set N of natural numbers together with inclusion order- 
ing. Let A be a nonempty subset of N. If @ € A, then ¢ is the least element of A. Let 
x € A. Then, since x € N is an ordinal, 7, = x is a well-ordered set. Clearly, the 
least element of A (.} 7, is the least element of A. It follows that N together with the 
usual inclusion ordering is a well-ordered set. By the definition, 7, = x. Thus, N 
with usual ordering is an ordinal. This ordinal will be denoted by w. 


Example 2.6.4 Consider the set the successor N* of the set N of natural numbers. 
We extend the well-ordering of N to the ordering <j+ on Nt by defining n <j+ N for 
alln € N. Clearly, (N*, <j+) is a well-order. Further, it is also an ordinal number, 
for the strict initial segment ny = {n € Nt | n <y+ N} = N. This ordinal 
is the continuation of w, and it is denoted by w + 1. Similarly, we have the ordinal 
number w + 2, and so on. The axiom of replacement ensures the existence of the 
set {w +n | n € w} of ordinal numbers such thatw +n* = (w +n). This 
is a well-ordered set of ordinal numbers with w the least ordinal number. Indeed, 
w-+n* is the continuation of w +n. There is a unique well-order <.2) on the union 
w2 = Un,e,,(w +n) subject to the condition that their restriction to each w + n 
is the order <,,,, of the ordinal number w + n. This process continues to generate 
different ordinal numbers. 


Definition 2.6.5 Two partially ordered sets (X, <x), and (Y, <y) are said to be 
order isomorphic (also called similar) if there is a bijective map f from X to Y such 
that a <x b implies that f(a) <y f(b). 


Proposition 2.6.6 Let f be an order isomorphism from a partially ordered set 
(X, <x) to a partially ordered set (Y, <y). Then 


(i) a <x bimplies that f (a) <y f(b), 
(ii) f—! is an order isomorphism from (Y, <y) to (X, <x), and 
(iii) the relation of being ‘order isomorphic to’ is an equivalence relation on any 
set of partially ordered sets. 


Proof (i) Suppose that a <x b. Then, by the definition, f(a) <y f(b). Suppose that 
f(@ = fd). Since f is bijective, a = b. This is a contradiction to the supposition 
that a <y b. 
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(ii) Suppose that f is an order isomorphism. Suppose that c <y d, wherec,d € Y. 
Suppose thata = ff °C, andb = 7 Then, f(a) = candf(b) = d.If 
b <x a, then from (i), d = f(b) <y f(a) = c. This is a contradiction. Hence, 
f(O <x f-'(d). 

(iii) Since Jy is an order isomorphism from (X, <x) to itself, the relation is reflex- 
ive. If f is an order isomorphism from (X, <x) to (Y, <y), then, from (ii), it follows 
that | as is an order isomorphism from (Y, <y) to (X, <x). This shows that the 
relation is symmetric. Since the composition of two-order isomorphism is an order 
isomorphism, it follows that the relation is transitive. 


Proposition 2.6.7 A well-ordered set (X, <x) may be order isomorphic to a proper 
subset Y with induced well-ordering. If f is an injective order preserving map from 
a well-ordered subset (X, <x) to itself, thena < f(a) forallae X. 


Proof N is a well-ordered set with usual ordering, and the successor map s from N 
to its proper subset N — {} is an order isomorphism. Let f be an injective order 
preserving map from a well-ordered subset (X, <x) to itself. Let A = {x € X | 
f() <x x}. Suppose that A 4 @. Since (X, <y) is a well-ordered set, A has the least 
element b (say). Then, f(b) <x b. From the above proposition, f(f(b)) <x f(b). 
This means that f(b) € A. This is a contradiction. Hence, A = @, andsoa <x f(a) 
forallae X. tt 


Corollary 2.6.8 Let (X, <x) and (Y, <y) be two well-ordered sets which are order 
isomorphic. Then, there is a unique order isomorphism from X to Y. 


Proof Letf and g be two-order isomorphisms from X to Y. Then, g~!of is an order 
isomorphism from X to itself. From the previous proposition, a <x g~!(f(a)) for all 
a € X. This means that g(a) <x f(a) for all a € X. Similarly, considering the order 
isomorphism f~!og, we conclude that f(a) <x g(a) for all a € X. This shows that 


f= 49. t 


Corollary 2.6.9 A well-ordered set can not be order isomorphic to any of its strict 
initial segment. 


Proof Let (X, <x) bea well-ordered set. Letx € X. Consider the strict initial segment 
n,. Let f be a map from X to 7,. Then, f(x) <x x. From the Proposition 2.6.7, it 
follows that f can not be an order isomorphism. ft 


Corollary 2.6.10 The only order isomorphism from a well-ordered set (X, <x) to 
itself is the identity map. tt 


Corollary 2.6.11 Let (X, <x) and (Y, <y) be well-ordered sets. Then, one and only 
one of the following hold: 


(i) (X, <x) is order isomorphic to a strict initial segment of (Y, <y). 
(ii) (Y, <y) is order isomorphic to a strict initial segment of (X, <x). 
(iii) (X, <x) is order isomorphic to (Y, <y). 
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Proof From the Corollary 2.6.9, it follows that at the most one of the above condition 
can hold. We need to prove that at least one of the three conditions hold. Let (X, <x) 
and (Y, <y) be well-ordered sets. Further, from the Corollary 2.6.9, it again follows 
that in any well-ordered set the strict initial segment associated with a is order 
isomorphic to an strict initial segment associated with b if and only if a = b. Let 


x = {x € X | 7, is order isomorphic to ny for some, y € Y}. 


Leta € X,andx < x a,x € X. Then, there is a unique element b € Y such that 7, is 
order isomorphic to 7. Let f be the unique order isomorphism from 7, to 7. Then, 
f(x) € m, and the restriction of f to 7, is an order isomorphism from 7, to nx). 
Hence, x € &. This ensures that & is an initial (not necessarily proper) segment of X. 
We have the map x from © to Y given by the condition that 7, is order isomorphic to 
"y(x)- Clearly, xy is an injective order preserving map. Observe that the image y(2) 
is also an initial segment of Y. If & = X, then X will either be order isomorphic 
to Y or it is order isomorphic to a proper initial segment of Y. Suppose that © # X. 
Then, & is a proper segment, and hence, there is an element x € X — & such that 
j = &. Suppose that x(x) # Y. Then (2%) is a proper segment of Y. Hence, 
there exists an element y € Y such that x(2) = ny. But then 7), is order isomorphic 
to my, where x ¢ &. This is a contradiction to the choice of &. Hence, y(X) = Y. 
This means that Y is order isomorphic to the initial segment & of X. tt 


Proposition 2.6.12 Let (a, <q) and (3, <,) be ordinal which are order isomorphic 
as well-ordered set. Then (a, <q) = (8, <s). 


Proof Let f be an order isomorphism from a to 3. We need to show that f(x) = x 
for all x € a. Consider y = {x € a | f(x) = x}. Suppose that y 4 a. Then, 
a—y 4. Since (a, <,) is a well-ordered set, a — y 4 @ has the least element a 
(say). Then, f(x) = x for all x € mq. Since f is an order isomorphism, and a and 3 
are ordinals,a = % = f(a) = Nya = f(a). This is a contradiction. Hence, 
7 =a. This shows that (a, <,) = (@, <g). tt 


Corollary 2.6.13 Every set of ordinal numbers is a total order. 
Proof Follows from Corollary 2.6.11 and the above proposition. tt 
Corollary 2.6.14 Every set of ordinal numbers is well-ordered. 


Proof Let be a set of ordinal numbers. Let & be a nonempty subset of Q. Let 
a € &. Ifa < £ for all G € &, then a is the least element of X&, and there is 
nothing to do. Suppose that there is a @ € & such that G < a. From the definition 
of ordinal number, 3 € a. This means that a) ¥ is non empty subset of a. Since 
a is a well-ordered set, it has the least element y (say). We show that + is the least 
element of X.Letd € U.Ifa < 6, theny < 6. Ifnot, thend < a,andsod € af). 
Since 7 is the least element of a (| X, 7 < 6. This shows that 7¥ is the least element 
of x. tt 
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The ordinals are of two types: Consider the ordinal w. For all n < w, there is 
an ordinal number m such that n < m < w. In other words, there is no immediate 
predecessor of w. Such ordinals are called the limit ordinals. All the natural numbers 
have immediate predecessors. These are not limit ordinals. The ordinal w2 is also a 
limit ordinal. 


Corollary 2.6.15 Let Q be a set of ordinal numbers. Then, Q is a order complete 
with respect to the ordering of ordinal numbers. 


Proof The result follows from the fact that every well-ordered set is order 
complete. ft 


Corollary 2.6.16 Let Q be aset of ordinal numbers. Then, there is an ordinal number 
a ¢ Q. In other words, there is no set containing all ordinal numbers. 


Proof Let Q be a set of ordinal numbers. Let G = U,<g a. Then, there is a unique 
order <g on § whose restriction to each a € Q is the order <, on a. Consider 
(3, <3). Leta € 2. Then, a € a for some a € Q. Hence, the strict initial segment 
Na is a itself. This shows that (3, <,) is an ordinal number which is an upper bound 
(indeed, L.u.b of Q) of Q. G may be a member of Q in case it is a limit ordinal. 
However, the successor (3+ of 3 is an ordinal number which does not belong to Q. tf 


Proposition 2.6.17 Let (X,<x) be a well-ordered set. Then, there is a unique 
ordinal (a, <q) which is order isomorphic to (X, <x). 


Proof The uniqueness part is evident from the Proposition 2.6.12. We show the 
existence of an ordinal which is order isomorphic to (X, <x). Let a be an element 
of X such that for each x € 7, there is, of course, unique ordinal a, which is order 
isomorphic to 7,. Clearly, the least element of X is such an element. It is also clear 
that 7, is order isomorphic to the ordinal (3, where /3 is the l.u.b of the set {a, | x € 7} 
of ordinals. This shows that if each strict initial segment of 7, is order isomorphic to 
an ordinal number, then 7, is also order isomorphic to an ordinal number. 
Let 


x = {a €X | Vx © %, 1, is order isomorphic to an ordinal number a,}. 


Clearly, the least element of X belongs to &. We first show that & = X. Suppose 
not. Then, X — X is a nonempty subset of X. Since (X, <x) is a well-ordered set, it 
has the least element a (say). Then, for all x <y a,x € X. This means that for all 
y € Nx, My 18 Order isomorphic to an ordinal a. From what we have already proved, 
it follows that 7, is also order isomorphic to an ordinal number a. We arrive at a 
contradiction that a ¢ &. Thus, & = X. Repeating again the previous arguments, 
we see that X is order isomorphic to an ordinal. tt 


Remark 2.6.18 The above proposition may prompt us to introduce an ordinal number 
as an equivalence class of well-ordered sets. But the equivalence classes are not sets. 
As such, one needs to select unique members from each equivalence classes. Indeed, 
this is what we have done in our approach. 
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Arithmetic of Ordinal Numbers 


Let (A, <,) and (B, <g) be two well-ordered sets. We have an order <4g on A X 
{0} U B x {1} defined as follows: (i) (a,0) <ap (a’,0) if and only if a <, a’, 
(ii) (b, 1) <p (0’, 1) if and only if b <p, b’, and (iti) (a, 0) <az (b, 1) forallae A 
and b € B. Clearly, <4, is a well-order. If (A, <,) is order isomorphic to (C, <c) and 
(B, <g) is order isomorphic to (D, <p), then itis clear that (A x {0} L) Bx {1}, <az) is 
order isomorphic (C x {0} L) Dx {1}, <cp). Thus, we can define, unambiguously, the 
sum a@ + £ of two ordinal numbers as follows: Suppose that a is order isomorphic to 
(A, <,) and ( is order isomorphic to (B, <g). Define a + ( to be the unique ordinal 
number which is order isomorphic to (A x {0} B x {1}, <ag). The following 
properties addition of ordinal numbers can be easily observed: 


Gi) a+0 = a = 0+<a, and 
Gi) (a+ 2+yeat G+) 
for all ordinals a, 3, and y. 


The addition of ordinal number is not commutative. Indeed, 1 + w = w is limit 
ordinal, where as w+ 1 = wt” is not limit ordinal. 

Next, suppose that (A, <4) and (B, <p) are two well-ordered sets. We have the 
lexicographic ordering <4,, on A x B defined as follows: (a,b) <axp (c,d) if 
b <g dor b =d anda <z, c. It can be checked that this order is a well-order. 
Further, if (A, <4) is order isomorphic to (C, <c) and (B, <g) is order isomorphic 
to (D, <p), then (A x B, <4) is order isomorphic to (C x D, <cxp). This prompts 
us to define the multiplication - on ordinals as follows: Suppose that the ordinal a is 
order isomorphic to the well-ordered set (A, <,) and the ordinal ( is order isomorphic 
to (B, <p). Define a - @ to be the unique ordinal which is order isomorphic to the 
well-ordered set (A x B, <4g). The following properties of - can be easily observed. 


Gi) a-0 = 0 = 0-a, 
Gi) a- 1 =a=1-a, 
(ii) (a 8)- = a- (6-7), and 
(iv) a- (G+) =a-Bt+a-y 
for all ordinal numbers a, /, and +. Note that the left distributivity of - over + 
need not hold. 


For further arithmetical properties of ordinals, refer to Naive set theory by Halmos, 
or to the set theory by Vipul Kakkar. 


2.7 Cardinal Numbers 


The abstraction of the counting process of finite sets leads to the concept of the 
ordinal numbers. On a finite set X, any two well-order structure is order isomorphic. 
Indeed, two finite well-ordered sets (X, <x) and (Y, <y) define the same ordinal 
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numbers if and only if their sizes are same in the sense that there is a bijective 
map from X to Y. However, this is not the situation in infinite case. Indeed, on the 
same infinite set X, we can have different well-order structures which define different 
ordinal numbers. For example, the usual well-order < on the set N of natural numbers 
determines the ordinal number w. We have another order <’ on N defined as follows: 
Ifm 4 1 4n,thenm <' nif and only if m < n. Also, m <' 1 for allm € N. 
Clearly, (N, <’) is a well-ordered set, and the initial segment 7 in (N, <’) is N— {1}. 
Note that (N, <) and (N, <’) are not order isomorphic. (N, <’) is order isomorphic 
to the ordinal w + 1. Similarly, we have another well-order on <” on N defined as 
follows: If m,n € N— {1, 2}, m <’ nif and only if m < n. Also, m <” 1 <" 2 forall 
m,n € N—{1, 2}. Then, (N, <”) is order isomorphic to w+ 2, and so on. Thus, there 
are infinitely many nonorder isomorphic well-order structures on N corresponding 
to different ordinals. Thus, ordinals are good for counting, but it does not distinguish 
the size of the infinite sets. This prompts us to look for an other concept, the concept 
of cardinals which measures the size of sets. 


Definition 2.7.1 Let X and Y be sets. We say that X dominates Y if there is an 
injective map from Y to X. X and Y are said to be equipotent or equinumerous if 
there is a bijective map from X to Y. We use the notation X ~ Y to say that X is 
equipotent to Y. 


Theorem 2.7.2 (Schroder-Bernstein Theorem) If X dominates Y and Y also domi- 
nates X, then X is equipotent to Y. More explicitly, if there is an injective map from 
X to Y, and also there is an injective map from Y to X, then there is a bijective map 
from X to Y. 


Proof Let f be an injective map from X to Y and g be an injective map from Y 
to X. We have to show that X and Y are equipotent. Put X — g(Y) = Z. Then, 
X = g(Y) UZ, where g(Y) and Z are disjoint. Since Y and g(Y) are equipotent, it 
is sufficient to show that g(Y) and X are equipotent. Let u € (gof)"(Z) (\(gof)*(Z), 
where r < s. Then, there existx, y € Z such thatu = (gof)'(x) = (gof)*(y). Since 
gof is injective,x = (gof)* "(y). This means that x € Z (] g(Y). This is impossible. 
It follows that (gof)’(Z) (\(gof)*(Z) = Oforallr As. PutU = U,en(gof)’(Z), 
and V = Uren (gef)’ (Z). Then, U = (gof)(Z)\ V. From what we have 
observed, it follows that (gof)(Z) and V are disjoint. Also, U and V are equipotent. 
Indeed, gof is a bijective map from U to V. Also Z and (gof)(Z) are equipotent. 
Since U = (gof)(Z) U V,Z U U is equipotent to U(note that Z and U are disjoint). 
Now, put g(Y) — U = W. Then, g(Y) = UUW. Hence, X = UUWLUZ. 
Since U |) Z is equipotent to U, U LJ W is equipotent to X. This shows that g(Y) is 
equipotent to X. ft 


Definition 2.7.3. An ordinal number a is said to be a cardinal number if whenever 
an ordinal number (3 is equipotent to a, a < (3. 


Thus, all natural numbers are cardinal numbers. w is a cardinal number, whereas w+ 1 
is not a cardinal number. The ordinal number w considered as a cardinal number is 
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denoted by No. Indeed, an infinite cardinal number is a limit ordinal. It follows from 
the properties of ordinal numbers that a set of cardinal numbers is totally ordered. 

Let X be a set. From the well-ordering principle, there is a well-order on X. The 
set of all ordinal numbers which are order isomorphic to the different well-order 
structures on X has the least element. This least element is clearly a cardinal number, 
and it is called the cardinal number of X. The cardinal number of X is denoted by 
| X |. Evidently, | X | =| Y | if and only if X is equipotent to Y. Further, ifa = | X | 
andb =| Y | are two cardinal numbers, then a < bif and only if there is an injective 
map from X to Y. 


Definition 2.7.4 A set X is said to be a countable set if | X | is a natural number, or 
it is No. It is said to be an infinite countable set if | X |= No. Thus, X is countably 
infinite if and only if there is a bijective map from X to N. A set X is said to be 
uncountable if it is not countable. 


Since there is no surjective map from N to the power set so (N), so (N) is uncount- 
able. Observe that go (N) and 2% are equipotent, and so | g (N) | =| 2 |. The cardinal 
number | 2% | is denoted by &;. The cardinal number | 22" | is denoted by 2, and so 
on. If 8 is an cardinal number, then the cardinal number | 2* | is denoted by 2°. If A is 
equipotent to B, and C is equipotent to D, then A is equipotent to B?. Thus, we can, 
unambiguously, define the power a’ as follows: Suppose thata =| A |andb =| B|. 
Define a’ = | AB |. In turn, for each cardinal number &, we have the cardinal number 
2®, and we have a chain of infinite cardinal numbers Xo, 81, 82, ---, Nas Natis s+ 
where Xy4) = 2o of infinite cardinal numbers, where a runs over a chain of ordinal 
numbers. 

Continuum hypothesis. The continuum hypothesis (CH) asserts that there is no 
cardinal number in between %) =| N | and &; =| 2N |. More precisely, it asserts 
that if there is a set A such that there is an injective map from N to A, and there is an 
injective map from A to 2", then A is equipotent to N or it is equipotent to 2, 

K.Godel in 1939 proved that if the ZF axiomatic system is consistent, then adjunc- 
tion of CH in ZF does not lead to any contradiction. In other words, CH is consistent 
with the ZF axiomatic system. Further, in 1963, P. Cohen proved that ZF axiomatic 
system does not lead to a proof of CH. Consequently, CH is independent of the ZF 
axiomatic system. 

Generalized continuum hypothesis. The generalized continuum hypothesis 
(GCH) asserts that for each ordinal a, there is no cardinal number between &, and 
Nott = 2%. The topologist Sierpinski proved that GCH implies axiom of choice. 
K.Godel also showed that GCH is consistent with the ZF axiomatic system. 


Arithmetic of Cardinal Numbers 


Let (A, C) and (B, D) be pairs of equipotent sets. Suppose that A(.)B = @ = 
C ()D. It is evident that A L) C is equipotent to B J D. Thus, we have the addition + 
on a suitable set Q of cardinal numbers defined bya+b =| (A x {0}) U(B x {1}) |, 
where a =| A |andb =| B |. The following properties of + can be verified easily. 


2.7 Cardinal Numbers 51 


G) a+0 =a = O+a, 

(ii) (at+b)+c = a+(b+0o), 

(iii) a+b = b+a, and 

(iii) a < bandc < d implies thata+c<b+d 
for all a,b, c,d € Q. 


We can also define sum of an arbitrary family of cardinal numbers as follows: Let 
{dq =| Aa | | a € A} be a family of cardinal numbers. We define 


Drekay =| JAgx tel 


aeA 


Recall that a set X is said to be a finite set if every injective map from X to itself 
is a surjective map. It is easily observed that a set X is finite if and only if | X | isa 
natural number. 


Proposition 2.7.5 A set X is infinite if and only if there is an injective map from the 
set N of natural numbers to X. 


To prove this result, we need the following recursion theorem whose proof can 
be found in the next chapter. 

Recursion Theorem. Let X be a set and a € X. Let f be a map from X to X. 
Then, there is a unique map g from N to X such that g(1) = a and g(n*) = f(g(n)) 
for alln € N. 

Proof of the proposition 2.7.5: Let X be an infinite set. Let f be an injective map 
from X to X which is not surjective. Let a € X which is not in the image of f. By the 
recursion theorem, there is a unique map g from N to X with g(1) = aand is such 
that g(n*) = f(g(a)). Let 


M = {méEN|g(m) = g(n) implies thatm = n}. 


Since a is not in the image of f, 1 € M. Suppose that m € M. Then, g(m) = g(n) 
implies that m = n. Suppose that g(m*) = g(n). Then, n 4 1. Hence, there is an 
element r € N such that n = rt. By the definition of g, f(g(m)) = f(g(r)). Since f 
is injective, g(m) = g(r). This means that m = r, and som* = n. It follows that 
mt € M.By Ps,M = N. This shows that g is injective. t 


Proposition 2.7.6 So + Xo = Xo. 


Proof Itis sufficient to give a bijective map from N x {0} J Nx {1} to N. Let X denote 
the set {2n | n € N} of even natural numbers and Y denote the set {2n + 1 | n € N} 
of odd natural numbers. Then, X and Y are disjoint. Further, n ~» 2n is a bijective 
map from N to X, and n ~ 2n + | is a bijective map from N to Y. This shows that 
N x {0} UN x {1} is equipotent to N. The result follows. tt 


Corollary 2.7.7 For every natural number n, %) + n = Xo. 
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Proof (N x {0}Un x {1}) c (N x {0} UN x {1}). From the above proposition, 
it follows that (N x {0}UJn x {1}) is equipotent to a subset of N. Also, the map 
n ~~» (n,Q) is an injective map from N to (N x {0}Ua x {1}). By the Schréder— 
Berstein theorem, N is equipotent to (N x {0} Un x {1}). The result follows. t 


Proposition 2.7.8 If a is an infinite cardinal, thena+a = a. 


Proof Let us suppose that a =| A |, where A is an infinite set. We need to show that 
A x {0} UA x {1} is equipotent to A. Let 


x = {(X,f) |X CA, f is a bijective map from X to X x {0} Lx x {1}}. 


Since A is infinite, by the Proposition 2.7.5, there is a subset X of A which is equipotent 
to N. From the Proposition 2.7.6, X and X x {0} ) X x {1} are equipotent. Hence, 
x 4 WY. Define a partial order < on & by putting (X,f) < (Y,g) if X © Y and 
g/X = f.Clearly, (2, <) isanonempty partially ordered set. Let {(Xq, fa) | a € A} 
be achain in (X, <). LetX = Lee A Xo and f be the map whose restriction to each 
X, is f,. Itis an easy observation that f is a bijective map from X to X x {0} L) X x {1}. 
This shows that (X, f) is an upper bound of the chain. By the Zorn’s lemma, (X, <) 
has a maximal element (Xo, fo) (say). Now, we show that A — Xo is a finite set. 
Suppose not. Again, by the Proposition 2.7.5, there is a subset Z of A — Xo which is 
equipotent to N. But, then there is a bijective map h from Z to Z x {0} UZ x {1}. 
Take U = Xo Z, and the map ¢ from U to U x {0} LU U x {1} whose restriction to 
Xo is fo, and whose restriction to Z is h. Clearly, (U, ¢) € &. This is a contradiction 
to the maximality of (Xo, fo). Thus, A — Xo is finite. From the Corollary 2.7.7, 


a =|A|=|Xo|=| Xo x (0) Xo x (I) =| X01 + |Xol= a + a. 


t 


Now, we define the product - of two cardinal numbers as follows: First observe that 
| A |= |C|and|B|=| D | imply that | A x B | =| C x D |. Thus, we can, 
unambiguously, define the product a - b of two cardinal numbers a = | A | and 
b =|B\|bya-b =|A xB |. The following properties of the multiplication - can 
be easily observed: 


(i) a-0 = 0 = 0-a, 
(ii) (a-b)-c = a-(b-c), 
(iil) a-b = b-a,and 
(iv) a-(b+c) =a-b+a-c 
for all ordinal numbers a, b, and c. 


The proof of the following proposition uses Zorn’s lemma, and it is similar to the 
proof of the Proposition 2.7.8. 
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Proposition 2.7.9 Ifa is an infinite cardinal number, thena:a = a. tt 


As a corollary, we obtain the following: 


Proposition 2.7.10 [fa <b, thena-b = b. tt 


Gédel-Bernays Axiomatic system 


In 1920, John von Neumann attempted an other axiomatic system for set theory. His 
axiomatic system significantly differed from the ZF axiomatic system. Indeed, for 
him, the primitive term (concept) was that of a correspondence (a map) instead of a 
set. Later, Gddel and Bernays modified it to make it more appealing and near to ZF 
system. For them, the primitive term is class instead of set. A member of a class in 
this axiomatic system is a set. Most of the axioms of the Gédel—Bernays system is 
same as those of ZF axiomatic system with set replaced by class except the axiom of 
replacement. Further, in this axiomatic system, a set may be a class, but then it does 
contain all sets or all ordinal numbers. Sets are those classes which are adequate to 
develop mathematics. The Gédel—Bernays axiomatic system is most suitable for the 
categorical discussions. 


Chapter 3 
Number System 


3.1 Natural Numbers 


What is one? One pen, one man. These all reflect the idea of being single which is 
the common property of all singletons. One may be tempted to represent ‘one’ by 
all singletons. But, if there is a set X containing all singletons, then {X} € X and 
also X € {X}. This is a contradiction to the axiom of regularity in set theory. Thus, 
instead of looking at all singletons, we choose a canonical representative {J} of the 
class of all singletons to define one. Similarly, the representative {, {G}} = {0}? of 
all doubletons is chosen to represent two and so on. It has been seen in the previous 
chapter that successor sets contain all these and the axiom of infinity ensures the 
existence of a successor set. We have also seen in the previous chapter that there is 
a unique successor set contained in all successor sets. 

Recall that the set N of natural numbers is the smallest successor set, and in turn, 
it satisfies the following properties termed as Peano’s axioms. 


P\. 1eEN. 

P>. For allx e N, xt EN. 

P3.xt = y* if and only if x= y. 

Py. For allx EN, 1 A x*. 

Ps. If M is a set such that 1 € M and xt € M for allx € M()\N, thenN C M. 


The properties P;, P2 and P3 follow from the fact that N is a successor set. To 
prove P4, suppose that 1= xt for some x € N. Clearly, x 4 1, for 1* 4 1 (Exercise 
2.1.19). Thus, 1 € N — {x}. Suppose that y e N — {x}. If yt = x then (yt)? = 
xt = 1 = {G} = @~. But, then yt = @. This means that y € @ (a contradiction). 
Hence, y* € N — {x}. This shows that N — {x} is a successor set, a contradiction to 
the fact that N is the smallest successor set. This proves P4. Under the hypothesis of 
Ps, M (| N becomes a successor set. Since N is the smallest successor set, M CN. 
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The property Ps is called the principle of induction. A proof using Ps is called 
a proof by induction. 

Now, instead of looking N as the smallest successor set, we shall be using only 
the properties P; — Ps which is sufficient for the further course of developments. We 
shall call x* the successor of x. 


Proposition 3.1.1 xt Ax for allx € N. 


Proof Let M = {x € N | xt # x}. Since 1+ 4 1(Py), 1 € M. Let y € M. Then 
yt # y. Now (yt)* # y* for, otherwise by P3, yt = y. Thus, yt € M. By 
P5,NCM CN. HenceM=N. tt 


Proposition 3.1.2 / is the only element in N which is not successor of any element 
inN. 


Proof From P4, 1 is not successor of any element in N. If a £ 1 is not successor of 
any element in N, then by P5, N C N — {a}. This is a contradiction. tt 


Recall that a set X is called finite if every injective map from X to X is surjective 
(equivalently every surjective map from X to X is injective (Exercise 2.4.10)). A set 
which is not finite is called an infinite set. 


Proposition 3.1.3 N is an infinite set. 


Proof The map s : N —> N defined by s(x) = x* is an injective map (P3). It is not 
surjective for 1 is not successor of any element in N. tt 


The following theorem gives sound and rigorous footing for the definitions by 
induction. 


Theorem 3.1.4 (Recursion Theorem) Let X be a set anda € X. Let f be a map 
from X to X. Then, there is a unique map g from N to X such that g(1) = a and 
g(n*) = f(g(n)) for alln EN. 


Proof We first show the uniqueness.' Let g and h be maps from N to X such that 
gd) = a = AC) and g(n*) = f(g(n)), A(n*) = f(A(~)) for all n € N. Let 
M = {née N| gn) = h(n}. Since g(1) = h(1) = a, 1 € M. Suppose that n € M. 
Then, g(n*) = f(g(n)) = f(A(n)) = h(n*). Hence, n* € M. By Ps, M = N and so 
g(n) = h(n) for all n € N. This shows that g = h. 


Now, we show the existence. Let 
A={hCNxX|(1,a) € hand whenever (n, x) €h, (nt, f (x)) Eh}. 


Clearly, N x X € AandsoA # G. Let g = (),<,4h. Then, g is also a member of A. 
Hence, it is sufficient to show that gisamap. Let M = {n € N | there isa uniquex € 
X such that (n,x) € g}. Now (1, a) € g. Suppose that (1, b) € g, where a 4 b. 


'The reader may skip the proof. 
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Clearly, g — {(1, b)} € A. This shows that 1 € M. Suppose that n € M. Then, there 

is a unique x € X such that (n, x) € g. Since g € A, (n*, f(x)) € g. Suppose that 

(nt, b) € g, where b 4 f(x). Then, g — {(n*, b)} € A (verify). This shows that 

nt € M. By Ps, N= M. Hence, g is a map with the required property. tt 
Any use of recursion theorem is called a definition by induction. 


Definition 3.1.5 Let X beaset. Amapo : X xX —> X iscalledabinary operation 
in X. The image of (a, b) under o is denoted by aob. 


We define binary operations + and - in N as follows: 

Let s be the successor map from N to N given by s(n) = n*. Let m € N. Then, by 
the recursion theorem, there is a unique map fj, from N to N such that f,(1) = mt 
and f,,(n*) = s(fn(2)) = fin(n)*. Now, define a binary operation + in N by 


m+n = fn(n). 


Then, evidently 


G) m+ 1 = fx(1) = mt. 
Gi) m + nt = fn) = (fnln)yt = (m + ny*. 


Next, consider the map s” from N to N defined by s(n) = n-+=m. Again, by 
the recursion theorem, there is a unique map f” from N to N such that f"(1) = m 
and f”(nt) = s(f"™(n)) = f'"(n) +m. Define a binary operation - in N by 


m:n = f™(n). 


Evidently, 


G) m-1 =m. 
Gi) m-nt = f™nt) = s™Ff™n)) = m-n +m. 


Theorem 3.1.6 The triple (N, +, -) has the following properties. 


(i) + is associative in the sense that (n+m)+r=n+(m+r) for alln,m,r &€N. 
(ii) + is commutative in the sense thatm+n = n+mforallm,néN. 
(iii) l-n =n =n -1forallneN. 


(iv) - distributes over + from left as well as right in the sense that 


m:-(n+r) =m-n+m-r, 


and 
(m+n)-r=m-r+n-r 
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forallm,n,r €N. 


(v) - is associative in the sense that (m-n)-r = m-(n-r) for allm,n,r €N. 
(vi) - is commutative in the sense thatm:n = n-mforallm,néN. 
(vii) n+m An for allm,neN. 


(viii) Cancellation law holds for + in the sense thatm +n = m-+rimplies thatn = 
randm+n = r+nimplies that m= r. 


(ix) Cancellation law holds for - in the sense thatm +n = m-r implies thatn =r 
andn-m = r-mimplies thatn =r for all m,n,r €N. 


Proof We prove (ii), (iv), (vii), and (ix). The rest can be proved similarly and is left 
as an exercise. 


(ii) Let M = {n € N| 14+n=n-+1}. Clearly, 1 € M. Suppose that n € M. Then, 
l+nt =(1+n)* (bydef) = (n+ 1)* (orn eM) = (n*)* (bydef) = nt +1. 
Thus, nt ¢ M. By Ps, M=N,andson+1=1+4nforallne N. LetM’ ={ne 
N | n+m=m-+nfor allm € N}. We have already proved that 1 € M’. Letn € M’. 
Then, m+nt = (m+n)*t (by def) = (n+m)* (forn € M’) = n+m* (by def) = 
n+(m+1)=n+(14+m) (for 1 €M’) = (n+1)+m (by (i)) = nt +m. Thus, 
nt € M’ and hence M' =N. 

(iv) Take M = {re N|m-(n+r)=m-n + m-r forall m,n € N}. Since 
m-(n+1) = m-n* = m-n+m(bydef) = m-n + m-1, it follows that 1 € M. 
Suppose that r ¢ M. Then, m- (n+ rt) = m-(n+r)* (by def) = m-(n+r) + 
m (by def) = (m-n+m-r) +m(forr eM) = m-n+ (m-r +m) (by@)) = 
m-n + m-r* (by def). This shows that r+ € M. By Ps,M = N. 

(vii) LettM = {nEN|n+m#nforallme N}.1e€M,forl+m=m+4+1l1= 
mt # 1 (byP,). Suppose thatn € M.Then,n*+m = n* implies that (m+n)* =n". 
By P3, m+n =n, a contradiction to the supposition that n € M. This shows that 
nt €M.ByPs,M=N. 

(ix) Let M = {m € N | n-m = n-r implies that m = r}. Suppose thatn-1 = n-r. 
If r # 1, then, by Proposition 3.1.2, there is aq € N such that r = qt. But, then 
n=n-qt =n-q-+n. This is a contradiction to (vii). This shows that 1 € M. 
Suppose that m € M andn-m* =n-r. If r = 1, then from the earlier argument, 
mt = 1,acontradiction. Suppose that r 4 1. Then, again, r = g* for some q € N. 
Now, n-mt =n-q* implies thatn-m+n=n-q-+n. By (viii) n-m =n-q. Since 
mé€M,m=qandsom* = q* =r. Thus, m* € M. By Ps, M =N. tt 
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3.2. Ordering in N 


We define relations ‘<’ (called ‘less than’) and ‘<’ (called ‘less than or equal to’) 
on N as follows: 

<= {(a,b) €NxN| a+c=bD for some c € N}, 
and 

<=<UA ={(a,b)EeNxN| a=bora+c=b for somec €N}. 

We write ‘a < b’ to say that (a, b) €< and ‘a < b’ to say that (a, b) €<. Thus, 


a < bif and only if a+c=b for somec EN, 


and 
a<b <= a= bora+c = bfor someceéeN. 


Proposition 3.2.1 (i) a AaforallaeN. 
(ii) a < bimplies that b £ a for alla,b €N. 


(iii) [a < band b < c] implies that a < c foralla,b,c €N. 


Proof (i) By Theorem 3.1.6 (vii), it follows that there is no c € N such that 
a+c=a. Hence, by the definition, a £ a. 

(ii) Suppose thata < bandb < a. Then, by the definition of <, there exist c,d € N 
such that b = a+canda=b-+d. But, then b = b+ (c+d). This contradicts 
Theorem 3.1.6 (vii). 

(iii) Suppose that a < b and b < c. Then, there exist elements u,v € N such that 
b=a+uandc=b+v. But, thenc = a+ (u+v) for some u, v € N. By the 
definition of <, a <c. tt 


Remark 3.2.2 The above proposition implies that the relation ‘<’ is nonreflexive, 
nonsymmetric, antisymmetric, and transitive. 


Corollary 3.2.3. The relation ‘<’ is a partial order in N. 
Proof Follows from Proposition 3.2.1. tt 


Theorem 3.2.4 (Law of Trichotomy) Given a,b € N, one and only one of the 
following holds. 


(i) a=b 
(ii) a<b 
(iii) b<a 
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Proof It follows from the Proposition 3.2.1 (i), Gi) that at most one of the above 
conditions will be satisfied for any pair a,b € N. We have to show that at least 
one of the above three conditions hold for any pair a,b € N. Let M = {a EN | 
foranybée N, a=bora < borb < a}.Letb €N. Ifb ¥ 1, then by Proposition 
3.1.2,b = ct = 1+ c for some c € N. By the definition, 1 < b. This shows that 
1 e€ M. Leta € M. Given anyc € Na =cora < corc < a.LetbeN. 
If b = 1 then as before a = bor b < a. Suppose that b ¢ 1. Then again by 
Proposition 3.1.2, b = c* for some c € N. Sincea € M,a=cora < corc < a. 


Ifa =c, thenat = ct = b.Ifa < c, thenat < ct = b,andif 
c < a, thenb = c* < at. This shows that at ¢ M. By Ps,M = N, and the 
proof is complete. tt 


Corollary 3.2.5 ‘<’ is a total order inN. 
Proof Follows from the law of trichotomy. tt 
Theorem 3.2.6 (Well-ordering Property of N) (N, <) is a well-ordered set. 


Proof We know (Corollary 3.2.5) that (N, <) is a totally ordered set. Thus, it is 
sufficient to show that every nonempty subset of N has the least element. Let S 
be a nonempty subset of N. Lett M = {a e¢ N | a < x forall x € S}. Since 
S £%, there is an element a € S. But, then at ¢ M, for at £ a. Thus,M 4N. 
Since | < x for all x € N, 1 € M. Now, there is an element b € M such that 
bt ¢€ M, for otherwise, by Ps, M = N. Clearly, b < x for all x € S. The proof 
will be complete if we show that b € S. Suppose that b ¢ S. Then by the law of 
trichotomy, b < x for all x € S, and so bt < x for all x € S. This contradicts the 
fact that b+ ¢ M. t 


Corollary 3.2.7 (Second Principle Of Induction) Let M be a set of natural numbers 
such that 


(i) 1 eM, and 
(ii) {x EN |x < n} CM implies thatn € M. 


Then, M =N. 


Proof Suppose that M ¢ N. Then, S = N—M # &. By the well-ordering property 
of N, S has the least element 7 (say). Since 1 ¢ S, n 4 1.Sincen € S, n ¢ M. Let 
x € Nandx < n.Thenx ¢ S = N-—M, for n is the least element of S. Thus, 
x < nimplies that x € M. This means that {x e N |x < n} CM whereasn ¢ M. 
This is a contradiction to the hypothesis. tt 
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Exercises 

3.2.1 Show that (x+)t 4 x for allx EN. 

3.2.2 Prove the remaining part of the Theorem 3.1.6. 

3.2.3 Letn,m €N such thatn-m = 1. Show thatn = 1 andm = 1. 
3.2.4 Show that m < m-n for allm,n €N. 


3.2.5 Let f be an injective map from A to B. Suppose that A is infinite. Show that B 
is also infinite. 


3.2.6 Show that a subset of a finite set is finite. 
3.2.7 Show that every successor set is infinite. 


3.2.8 Let f be a surjective map from A to B. Suppose that B is infinite. Then show 
that A is also infinite. 


3.2.9 Let f be an injective map from X to X. Suppose that a ¢ f(X). Let g be a map 
from N to X such that g(1) = aand g(n*) = f(g(n)). Show that g is injective. 


3.2.10 Let X be an infinite set. Show that there is an injective map from N to X. 


3.2.11 Let n € N. Show that A, = {r € N |r <n} is finite. 
Hint. Use induction on n. Observe that there is a bijection from A, to A,41 — {r}, 1 < 
r<n+l. 


3.2.12*. Let X be an infinite set. Show that there is a bijection from X x X to X. 


3.2.13 Suppose that there is an injective map from /\ to X. Let {f, : Aa —> X | 
a € /\} be a family of injective maps such that for alla, 6 € A, fa/(Aa(\Ag) = 
f3/(Aa (| Ag). Show that there is an injective map f from L),. \ Aa to X such that 
f/Aa = fo for all a. 


3.2.14*. Give a bijective map from N x N to N. 


3.2.15 Call a set A to be countable if there is a bijective map from N to A. Show 
that if A and B are countable then A x B is also countable. 


3.2.16 Show that P(N) is not countable. 
3.2.17 Show that finite union of countable sets is countable. 


3.2.18*. Show that countable unions of countable sets are countable. 


62 3 Number System 


3.3 Integers 


We have observed in Sect. 3.1 that an equation a + x = bneed not have any solution 
in N for a, b € N. However, if it has a solution, then there is a unique solution. Our 
aim is to enlarge the system (N, +, -) so that equations a + x = bin the enlarged 
system has always a unique solution. The solution of a + x = b will of course 
depend on the pair (a, b). It is natural temptation to consider the setX = NxN 
for the purpose. The solutions of a + x = bandc + x = d should be same if 
and only ifa + d = b + c. Thus, we should identify the pairs (a, b) and (c, d) 
whenever a + d = b + c. We therefore define a relation ~ on X by 


(a,b) ~ (c,d) if and onlyifa+d=b+ ec. 


Using the property of + in N, we can verify that the relation ~ is an equivalence 
relation on X. The equivalence class determined by (a, b) is denoted by (a, b). Thus, 


(a,b) = {(c,d) ENxN| (a,b) ~(c,d} = {(c,d) eNxN|at+d = b+c} 


and 


(a,b) = (c,d) if and only ifa+d = b+c. 


Definition. The equivalence class (a, b) is called an integer, and the quotient set 
X/~= {(a,b) | (a,b) € N x N} is called the set of integers and is denoted by Z. 


Binary operations in Z. 


Proposition 3.3.1 Suppose that (a,b) = (u,v) and (c,d) = (w,x). Then 


(a+c,b+d) = (u+u,v+x), 


(ac + bd,ad+bc) = (uw + vx, ux+ vw). 
Proof Under the hypothesis of the proposition,a+v = b+uandc+x = d+w. 
But, then 
ate+tvt+x=b+d+u+w 


Thus, 


(a+c,b+d)=(u+uwuw,v4+x). 


Similarly, 


(ac+ bd,ad+bc) = (uw + vx, ux+ vw). tt 
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The above proposition allows to define binary operations @ and x in Z as follows: 


(a,b) ® (c,d) = (a+ c, b+ d) 


and 


(a,b) * (c,d) = (ac + bd, ad + bc) 


Theorem 3.3.2 The triple (Z, ®, *) has the following properties: 


(i) ® is associative in the sense that 


(x®y) Oz = xOWOC2) 


forallx,y,z€ Z. 


(ii) ® is commutative in the sense that 


xOy = yO@x 


forall x,y € Z. 


(iii) There is a unique element in 0 € Z such that 


O@®x=x=x@0 


forall x € Z. 


(iv) For all x € Z, there is a unique element (—x) € Z such that 


x@®(—-x) = 0 = (-*x) Ox 


(v) * is associative in the sense that 


(x*y)*Z = X*(y*z) 


forallx,y,z€ Z. 
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(vi) * is commutative in the sense that 


X*ey = yuu 


forall x,y € Z. 


(vii) There is unique element | € Z such that 


lex =x= xxl 


forall x € Z. 


(viii) * distributes over ® in the sense that 


xXx (y@z) = («x y) B(x * Zz) 


and 


(xBy)*z = (X*z) B(*zZ) 


forall x,y,z € Z. 


Proof We prove some of them and leave the rest as exercises. 
(iii). Sincea+1=1+a, (1,1) = (a,a) for allae N. TakeO = (1, 1). Let 
x = (a,b) € Z. Then 


x®@0 = (a,b 6,1) = (a+1,b4+1) = Gb) = x, 
fora+1+b= b+1+a. 


Similarly,0 @x = x for allx € Z. 
Suppose that there is an element 0 € Z such that 


for all x € Z. Then 


This proves (iii). 
(iv) Letx = (a,b) € Z. Take —x = (b, a). Then 


x® (-x) = (a,b) @(b,a) = (a+b,b+a) = U1) = 0. 


Similarly, (—x) @x = 0. 
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Suppose next that there is an element y € Z such that y @x = 0. Then 

y = y@0 = yO @(-x)) = CW @x) @ (-x) = 0G (-x) = (-x). 
This proves (iv). 


(vii) Note that (1 + 1,1) = (a+ 1,a) for alla € N. We denote (1 + 1, 1) also 
by 1. Then 


Ix(a,b) = (a-A1+1)+b6-1a-1+b-(4+1)) = (a,b) 
Further, if 1 also satisfies the same property, then 
Letel=1 


This proves (vii). t 


The unique element 0 is called the zero of Z or the additive identity of (Z, ®). 
The unique element | is called one or the multiplicative identity of Z. The element 
—x is called the additive inverse of x or the negative of x. The element x @ (—y) 
will be denoted by x — y. 


Corollary 3.3.3. Cancellation law holds for ® in the sense that 
x®y=x@Oz = > thaty = z, 


and 
y@x = z@x = thaty =z 


Proof Suppose thatx ® y = x @z. Then 
y = 06y = (-x) Ox) Oy = (-XN)OWOyY) = (—-*) 9H OZ) = 
((-x)@x)@z = 0@z =z 


Similarly, the second part follows. ft 


Corollary 3.3.4 The equation a @X = _ b, where X is unknown has a unique 
solution in Z for all a, b € Z. 


Proof Check that —a @ b is a solution of the equation. The fact that the solution is 
unique follows from the cancellation law for @. ft 


Corollary 3.3.5 (i) —(—x) = x 
(ii) -~@®y) = -x -y 


(iii) xxO = 0 = Oxx and 
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(iv) x*(—y) = —@xy) = (—x) ey 
forall x,y € Z. 
Proof (i) Sincex ®@ (—x) = 0 = —x@x,-—(-—x) = xforallx € Z. 
(ii) Using the associativity and commutativity of @ in Z, 
(x @y) ®(-x-y) = *O-x) OW O(-y)) = 080 = 0 
Hence —(x® y) = —x—y. 


(ii) (Oxx) BO = Oxx = (OPO) xx = (O%x) G Onx). 
By the cancellation law for @,0 = 0x. Similarly, x * 0 = 0 for all x € Z. 
Similarly, we can prove (iv). ft 


Corollary 3.3.6 Let x,y € Z. Then 
xX*y = Oimplies thatx = Oory = 0. 


Proof Suppose that x = (a,b) 4 0. Then a ¥ b. By the law of trichotomy in N, 
a < borb < a. Suppose thata < b. Then, there is an element u € N such that 
a+u= b. Suppose that (a, b) x (c,d) = 0. Then, (ac + bd,ad+ bc) = 0. This 
means that ac + bd = ad + bc. Substituting b = a+ u and using cancellation law 
in N, we find that c = d, and so (c,d) = 0. Similarly, if b < a, we can show that 
(c,d) = 0. tt 


Corollary 3.3.7 The restricted cancellation law holds for x in Z in the following 
sense. 
Ifx #Oandxxy = x*z, theny = z, 


and 
ifx #40 andyxx = zx*x, theny = z. 


Proof Suppose that x A O andx*«y = xz. Then 
X*ey —X*Z7= 0 = x«(y—-Z) 


From the previous corollary, it follows that y— z = 0. Hence y = z. Similarly, the 
second part follows. ft 


Embedding of N in Z 
Define a map f from N to Z by f(n) = (n+ 1, 1). Then, f is injective (verify). Also 
it is easy to check that 


faatm) = fr) fm) 
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and 


f(n-m) = f(n) xf (m) 


Thus, f is an operation preserving injective map. Such a map is called an 
embedding. Also observe that f(1) = 1. 

Let (a, b) € Z, a, b € N. By the law of trichotomy in N, one and only one of the 
following holds: 


(i)a=b 
(ii) a < b 
(iii) b < a 


Ifa =), then (a,b) = 0 € Z. Suppose that a < b. Then, there is an element 
c € Nsuchthata+c = b. In this case 


(a,b) = @atc) = 0,1 +e) = -C+1,1) = -fO), 


where f is the embedding of N in to Z defined above. Thus, in the case (ii) —(a, b) € 
FN). Similarly, in case (iii), we find that (a,b) € f(N). It follows that for any 
member x € Z, one and only one of the following holds: 


Gj) x = 0 
(ii) x € f(N). 
(iii) —x € f(N). 
Since f is an injective map which preserves operations, there is no loss in identi- 
fying f(n) by n for all n in N. As such, 


Z = N\ Jto}J -N, 


where -N = {x € Z| —x € N}. Also, given any x € Z, one and only one of the 
following holds: 


Gj) x = 0 
(ii) x Ee N 
(iii) —x e N 
There is no loss in denoting the operations @ and « by the original + and - 
respectively. 


Order in Z 
We define a relation ‘<’ called lessthan and a relation ‘<’ called 
less than or equal to in Z as follows: 


<= {(a,b)€ZxZ\|b = a+c for some c € N} 
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and 


#=<| JA, 


where A is the diagonal relation on Z. 
Thus, a < bif there is an element c € N suchthatb = a+c.a<bifa < b 
or a = b. In other words, a < bif—a+beéN,anda < bif—a+beN\Uf{0}. 


Theorem 3.3.8 (Law of Trichotomy in Z) Let a, b € Z. Then, one and only one of 
the following holds: 


(i) a= b 
(ii) a<b 
(iii) b< a 
Proof Consider the element —a + b of Z. From the fact already seen, one and only 
one of the following holds: 


Gi) -a+b=0 
(ii) -a+beN 
(ili) —(—-a+b) = —b-+a belongs to N. Evidently, the result follows. tt 


Remark 3.3.9 (i) The embedding f of N in to Z is also order preserving. 
(ii) It follows from the above theorem that < is a total order, where as < is anti- 
symmetric and transitive. 
(ii) N = {x € Z| 0 < x}. Thus, N is also termed as the set of positive integers. 


Absolute Value 
The map | | from Z to N [J{0} defined by 


0 ifa=o 
lal= ya ifaeN 
—aif—aeN 
is called the absolute value on Z. We denote | | (a) by | a | and call it the 


absolute value of a. 
Theorem 3.3.10 Let a,b € Z. Then, the following hold. 


(i) |a|= Oif and only ifa = 0. 
(ii) \la || =| al. 

(iii) |a|=|-—a\. 

(iv) aS<|a\. 

(v) |ab|=|a|| 5}. 

(vi) |a+b|<|a| + |b\. 

(vii) ||a@|—|b||s|a—5|. 
(viii) |a|<|ab| forallb £0. 
(ix) |a—b| < max(a, b). 
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Proof (i), (ii), and (iii) follow from the definition itself. 

(iv). Ifa € N U{0}, then a =| a |. Suppose that —a € N. Then | a | = —a, and since 
-—a = a+ (—a + —a), it follows thata < —a =| a |. This proves the (iv). 
(v). If a = 0 or b = O, then both sides of (v) are 0. If a, b € N, then both sides are 
ab. If —a, —b € N, then again both sides are ab. Finally, if a ¢ N and —b € N, then 
both sides are —ab. This proves (v). 

(vi). If a, b € NU {0}, then both sides of (vi) are a + b. If —a, —b € N, then both 
sides are —a—b. Suppose that a € Nand —b € N. Ifa = —b, then the left hand side 
is 0, the right hand side is in N, and so the inequality holds. Suppose that a 4 —b. 
Thena+beNor—a—beN. In the first case, 


lal¢+ |bl=a-—-b=a+b—-—b—b =|a+b| +(-b-D, 


and hence the inequality holds. Similarly, in the second case also the inequality holds. 
(vil). |a|=|a—b + b\s|a—b| + |b| (by (vi). 


Hence 

|a|—|b|s|a—b| 
Similarly, | b | — | a | <| a—b |. This proves (vii). 
(viii) If a = 0, then | a | = O | ab |. Suppose a 4 0, b # 0. Then | a |,| b JE N, 
and also | ab | = | a| | b |e N. The result follows from Exercise 3.2.4. 
(ix) Let a,b € Z. Suppose thata < b. Then| a—b|= b-a < Db, for 
b= b-a-+a. tt 


Corollary 3.3.11 Leta, b € Z such that ab = | or (ab = —1). Thena = +1, b = 
l(a = +1, b = F1). 


Proof Suppose that ab = 1. Then | a || b| = 1. From the Exercise 3.2.3, it follows 
that|a|= 1 =|b|. This shows thata = +1,b = +1. tt 


Theorem 3.3.12 (Division Algorithm) Let a,b € Zand b 4 0. Then, there exists a 
unique pair (q,r) € Z x Z such that 


a=bq+yr, 


whereQO <r <|b|. 


Proof We first prove the existence of the pair (g, r) with the required property. If 
a = bq + r,whereO <r <|b|, thena = (—b)(—q) + r, where r again has 
the property 0 < r <|b|=|-—b |. Hence, we can assume that b € N. Further, 
ifa =0,thenO = b-0O + O, and there is nothing to prove. Suppose that a # 0. 
Ifa = bg + randr =0,then-a = b(—-qg) + 0.1f0 < r < Db, then 
—a = b(—q-—1) + b—r,where0 < b—r < b. Therefore, without any loss, we 
can assume that a € N. The proof is by induction on a. If b = 1 thena = 1-a + 0, 
and there is nothing to do. If 1 < banda =1,thena = b-0 + a. Thus, the 
result is true for a = 1. Assume that the result is true fora, anda = bq + r, where 
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O<r < b.Thena+1 = bg + r+1,wherer+1<b.Ifr+1 < b, then there 
is nothing to do. Ifr+1 = b,thena+1 = b(q+1) + 0. Thus, we have proved 
the existence of a pair (q, r) with the required property. 

Now, we prove the uniqueness of the pair (g, r). Suppose that 


a=ba+n=bo+n, 
where 0 <r, <|b|andO<r, <|b| Then 
ba -@m=n- nN 
Ifr; # ro, then 
lb|<|b||q — q2|< max(r2,7r1) <|b| 


This is a contradiction. Hence 7; = 7. Sinceb #0, qi = q. tt 


Alternate proof for the existence of a pair (q, r): 

Consider the set X = {a — bq | bg < a} C NU{0}. It is easy to show that X 4 4 
dfb < a, takeq=1, and if b > a, then take q = —1).If0 € X, thena = bq 
for some gq and there is nothing to do. If 0 ¢ X, then X is anonempty subset of N. By 
the well-ordering property of N, X has the least element r (say). Thena = bg + r, 
where r <|b |, for otherwise r— |b|= a — bq— |b |e X isacontradiction to 
the choice of r. tt 


Remark 3.3.13 The first proof for the existence of the pair (g,7) is algorithmic 
whereas the alternate proof is the existential proof. The integer g is called the quotient 
and r is called the remainder obtained when a is divided by b. 


Leta € Z, a #0. Leth € Z. We say that a divides b if there exists ac € Z such 
that b = ac. We use the notation a/b to say that a divides b. SinceO = a-0,a 
divides 0. Again, sincea = 1-a = —1- (—a), l/a and —1/a for alla € Z. If 
b/a for all a € Z, then, in particular, b/1. Hence, there is an element c € Z such 
that 1 = bc. It follows from the Corollary 3.3.11 that b = +1. This shows that 1 
and —1 are the only integers which divide each integer. The elements 1 and —1 are 
called the units of Z. Thus, units divide each integer. 

The relation ‘divides’ is reflexive and transitive, but it is neither symmetric nor 
antisymmetric (verify). 

Suppose that a/b and b/a. Then, there exist c,d € Z such thatb = ac and 
a = bd. But, then 


b-l=b=b-d-c 


By the restricted cancellation law cd = 1. Hence c, d are units. Conversely, if a 
and b differ by a unit, then a/b and b/a. Thus, a/b and b/a if and only if a and b differ 
by a unit. 
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We say that a, b € Z are associate to each other if a/b and b/a or equivalently 
a = ub, whereu = +1 is aunit. 
We have the relation ‘~’ called “is associate to” on Z* = Z — {0} defined by 


~= {(a,b) € Z x Z* | a/b and b/a} 
Thus, a ~ b if and only if a and b differ by a unit. 
The following proposition is immediate. 
Proposition 3.3.14 The relation ‘~’ is an equivalence relation on Z’. tt 


Givena € Z*, 1, —1,a, —aare divisors of a. They are called the improper divisors 
of a. Other divisors of a are called the proper divisors of a. 

An integer p ¢ {0, 1, —1} is called an irreducible integer if it has no proper 
divisors. 


Example 3.3.15 2 = 1+ 1 is an irreducible integer. 


Proof Suppose that2 = a-b, a, b € Z. We have to show that a or b is a unit. 


Clearly, a and b are nonzero, and so | a | and | b | are in N. Further,2 =|a|-|)b|. 
Suppose that | a | # 1 and | b|¥ 1. Then, since 1 is the only element in N which 
is not successor of any element in N,| a |= n+1and|b|= m+1 for some 


n, m EN. But, then 
14+1=22= (n4+1)-(m4l=mintnan-m+ti 


By the cancellation law 1 = n + m + n-m. This means thatn < 1 whichisa 
contradiction to the fact that | is the least element of N. tt 


3.4 Greatest Common Divisor, Least Common Multiple 
Let a,b € Z*. An element d € Z is called a greatest common divisor (g.c.d) or 
greatest common factor (g.c.f) of a and b if 


(i) dla, d/b, and 
(ii) (d’/a and d'/b) implies that d'/d. 


An element m is called a Least common multiple (l.c.m) if 


(i) a/m, b/m, and 
(ii) (a/m’ and b/m’) implies that m/m’. 


Proposition 3.4.1 If d, and dz are the greatest common divisors of a and b, then 
d, ~ dy. If m, and my are least common multiples of a and b, then m, ~ my 
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Proof Since d,; and d) are both assumed to be the greatest common divisors, d /d2 
and dz/d;. Hence d; ~ d. Similarly, any two least common multiples are associates 
to each other. tt 


Remark 3.4.2 It follows from the above proposition that a positive (an integer which 
is in N is also called a positive integer and if its negative is in N, then it is also called 
a negative integer) greatest common divisor (least common multiple) is unique and 
some times this is called the greatest common divisor (least common multiple). 


A g.c.d of a and b is usually denoted by (a, b) and a |.c.m is usually denoted by 
La, b]. 


Theorem 3.4.3 Let m be a least common multiple and d be a greatest common 
divisor of integers a and b. Thena-b~m- d. 


Proof Since a/a-b and b/a-b, m/a- b. Suppose thata.-b = m- u. Since m is 
l.c.m of a and b, a/m. Suppose that m = a-v for some v € Z. Then 


a-b=a-v-u 


By the restricted cancellation law, b = v-u. This shows that u/b. Similarly, u/a. 
Hence u/d, and soa- b/m.-d. Next, since disag.c.dofaandb, a = d-s and 
b = d-t for some integers s and t. But, thena-b = d-s-t-d.Now,a/d-s-t 
and b/d-s-tand hence m/d-s-t. Thus,m-d/a-b. Hencea-b~ m.-d. tt 


Proposition 3.4.4 Leta, b € Z*. Suppose thata = b-q + r, where q,r € Z. 
Then d is a g.c.d. of a and b if and only if it is a g.c.d. of b and r. 


Proof Follows from the fact that the set of common divisors of a and b is same as 
the set of common divisors of b and r. tt 


Theorem 3.4.5 (Euclidean Algorithm) Let a,b € Z*. Then a greatest common 
divisor of a and b exists. If d is a greatest common divisor of a and b, then 
there exist u,v € Z such that 


d=u-a+uo-b 


Proof We prove it by induction on min(| a |,| 5 |). If min(| a |,| b |) = 1, 
then | a |= |b |= 1. In this case, 1 and —1 are the greatest common divisors of 
a = +landb = +1, and there is nothing to do. Assume that the result is true for 
all those pairs c, d for which min(| c |,| d |) < min(| a |, |b |). Then, we have to 
prove the result for a,b. If | a | =| b |, thena = +b. In this case, a and —a are 
the greatest common divisors, and there is nothing to do (a = 1-a + 0-band 
—a = —1-a + 0-b). Suppose that | b|= min(|a|,|b|) <|a|.Ifb/a, then b 
and —b are the greatest common divisors of a and b, and there is nothing to do. If b 
does not divide a, then by the division algorithm there exist g, r € Z such that 
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a=b-q+yr, 


where 0 < r < |b |. By the induction hypothesis, b and r have a g.c.d, and hence 
by Proposition 3.4.4, a and b also have a g.c.d. Further, the set of greatest common 
divisors of a and b are same as those of b and r. Let d be a greatest common divisor 
of a and b. Then, d is also a greatest common divisor of b and r. By the induction 
hypothesis, there exist u, v € Z such that 


d=u-b+uv-r=u-b+0-(a—b-q) =v-a+ (u-v-q)-b 


This completes the proof. tt 


Remark 3.4.6 The representation of a greatest common divisor d of a and b as 
d = u-a+ov-bisnotunique for,u-a+u-b = (u+x-b)-a+ (v—x-a)-b 
for all integers x. 


Corollary 3.4.7 Suppose that a/c and b/c. Then a- b/c - d, where d is g.c.d of a 
and b. 


Proof By the Euclidean algorithm there exist u,v € Zsuch thatd = u-a + v-b. 
Hencec:d = u-a-c + v-b-c. Since b/c and a/c, it follows thata-b/c-d. ff 


Corollary 3.4.8 Let a,b € Z*. Then l.c.m of a and b exists. Suppose thata = d-u 
and b = d.-v, where d is g.c.d of a and b. Then greatest common divisor (u, v) of 
uand v is a unit and d-u-v is a least common multiple of a and b. 


Proof If d' isa g.c.d of u and v, then d-d’/a and d-d'/b. But, then, d-d’/d, and so 
d' is a unit. Now, we show thatm = d-u-v isa least common multiple of a and 
b. Clearly,a = d-uandb = d.-v divide m. Suppose that a/m' and b/m'. Then 
m = a-kandm' = b-Iforsomek,! € Z. Thus,m’ = d-u-k = d-v-l. By 
the restricted cancellation law u-k = v-1. Nowwand v both divideu-k = v-l. 
Since (u, v) ~ 1, it follows from the above corollary that u - v divides u - k. Hence, 
m = duv dividesd-u-k = m’. This proves that m is l.c.m of a and b. tt 


Illustration of the Euclidean algorithm 
Now, we describe and illustrate the Euclidean algorithm to find the positive greatest 
common divisor d of a pair of integers a, b and also a pair of integers uw, v such that 


d=u-a+uo-b 


Let a and b be nonzero integers. By the division algorithm, we can find integers 
q and r such that 
a=b-q+yr, 


whereO <r <|b|.Ifr = 0, then (a,b) ~ b = 0-a + 1-b. Suppose that 
r & (0. Then (a, b) ~ (b,r). By the division algorithm we can find q; and 7; such 
that 
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b=q-rt+n. 
where 0 <7, <|r|.Ifr; = 0, then 

(a,b) ~ (br) ~ r = 1-a+ (-q)-b. 
Suppose that r; 4 0. Then 
(a,b) ~ (br) ~ (rer) 

By the division algorithm, again we can find gz and rz such that 

r=@Q@: i +h, 
where 0 < 12 < ry < r <|b|.Ifr. = O, then, 
@b)~On~@an~n = -q)-r+ 6b = (-q1)-a+ @-qg + 1-0. 


If r. 0, proceed further. This process stops after finitely many steps giving us a 
greatest common divisor d and integers u and v such that 


d=u-a+u-b 
As an example, we find g.c.d of 578 and 250 and also find integers u and v such 


that 578u + 250v = (578,250). Now, 
578 = 250 -2+78 


250=78 -3+ 16 
78=16-4+14 
16=14-14+2 
14=2-7+0 

Thus, (578,250) ~ 2. Also, 

2=16-1-14 
=16—1-(78—4- 16) 
=—1-78+5-16 
==1298 45+ (250 — 3~78) 
= 5.250 — 16-78 


=5.- 250 — 16- (578 — 2 - 250) 

= —16-578 + 37- 250. 

From now onward, the notation of multiplication - will be omitted unless there is 
any confusion. Thus, a - b will usually be written as ab. 


Proposition 3.4.9 Leta, b and c be nonzero integers. Then the following hold. 


(i) (a,ab) ~ a. 
(ii) (a,b) ~ (b, a). 
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(iii) ((a,b),c) ~ (a, (b, ©). 
(iv) (ca,cb) ~ c(a,b). 


Proof (i), (i), and (iii) are simple and left as exercises. For the (iv), letd = (a, b). 
Since d/a and d/b, cd/ca and cd/cb. Hence cd/(ca, cb). Further, since c/ca and 
clcb, cl(ca, cb). Suppose that (ca,cb) ~ cu. Then cu/ca and cu/cb. But, then 
u/a and u/b, and so u/d. This shows that cu ~ (ca, cb) divides cd. t 


Corollary 3.4.10 Jf (a,c) ~ land (b,c) ~ 1, then (ab,c) ~ 1. 


Proof By the Proposition 3.4.9, we have 
(ab, c) ~ (ab, (be, c)) ~ ((ab, be), c) ~ ((a, €)b, c) ~ (b,c) ~ 1 


t 


Remark 3.4.11 Tf pis an irreducible integer, then (p, a) ~ | or (p, a) ~ p according 
as p does not divides a or p divides a. 


A pair of integers a and b are said to be co-prime if (a, b) ~ 1. 


Theorem 3.4.12 Let p 4 +1 be an integer. Then the following two conditions are 
equivalent. 


(i) p is an irreducible integer. 


(ii) p/ab implies that p/a or p/b. 


Proof Let p be an irreducible integer. Suppose that p does not divide a and it also 
does not divide b. Then (p, a) ~ 1 and (p, b) ~ 1. By Corollary 3.4.10, (p, ab) ~ 
((p, pa), ab) ~ (p, (pa, ab)) ~ (p, (p, b)a) ~ (p, a) ~ 1. Hence, p does not divide 
ab. Thus, (i) implies (ii). 

Assume (ii). Suppose that p = ab. Then p/ab. But, then p/a or p/b. Ifa = pe, 


then p = pcb. By the restricted cancellation law cb = 1. But, thenb = +1 and 
c = +1. Thus,a = +p. This shows that p has no proper divisors, and so it is 
irreducible. tt 


An irreducible integer is also called a prime integer. 


Corollary 3.4.13 Let p be a prime integer such that p/ayaz +--+ a,. Then p/a; for 
some i. 


Proof Use induction on r. tt 


Theorem 3.4.14 (Fundamental Theorem of Arithmetic) Every nonzero nonunit inte- 
geracan be written as a finite product of irreducible integers. Further, the represen- 
tation of a as product of irreducible integers is unique in the sense that if 


a = Pipr-++* Pr = PiPr-** Ph 
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where p; and P; are irreducible integers, then 


fj) r=s, 
and 

(ii) there isa bijective correspondence f from 1, 2, ....., r to itself such that p; ~ Priiy 
for all i. In other words, p; ~ p', after some rearrangement. 


Proof We first prove that every nonzero nonunit integer can be written as product 
of irreducible integers. The proof is by induction on | a |. If| a | = 1, thenaisa 
unit, and there is nothing to do. Assume that the result is true for all those integers 
whose absolute values are less than n. Let a be an integer such that | a |= n. If ais 
irreducible, there is nothing to do. If not, then a = bc, where b and c are nonunits. 
Clearly, | b |< | a | and|c |< | a |. By the induction hypothesis, b and c are 
products of irreducible elements. Hence, a is also product of irreducible elements. 


Uniqueness. Suppose that 


P = Pipas+** Pr = PAP2°+* Diss 


where p; and p; are irreducible integers. Then p;/p'p5--- - p,. It follows from the 
above corollary that p, / Di for some j. Rearranging the factors, we may assume that 
Pi/p. Since p', is irreducible and p, is a nonunit, p; ~ p. Hence 


P1P2°*** Pr = PiUpy +++ Py, 
where u is a unit. By the restricted cancellation law, 


not 


P2P3°*** Pr = PyP3-+> + Py 


From the previous argument, we may assume that p2 ~ p5 ~ p'. Again canceling 
p2 and p3, we find that p3...p, ~ p...p). Proceeding this way, we find that 
P1,P2,---,Py and p\, p5,...,p%, both will exhaust simultaneously, for otherwise 
we Shall arrive at a product of irreducible integers equal to +1. This, however, is 
impossible. Hence r = s, and after some rearrangement p; ~ p;. tt 


Proposition 3.4.15 The set of positive primes is infinite. 


Proof Let pi, p2,..-, Pn be first n primes in ascending order. Then, a = pjp2---- 
Pn + 1 is anonunit, and so from the fundamental theorem of arithmetic, there is a 
prime p which divides a. Clearly, p 4 p; for all i. Thus, the set of positive primes 
contains more than n elements for every n € N. This shows that the set of positive 
primes is infinite. tt 


Positive multiplicative integral powers of an integer 

We define a” by induction for all a € Z andn € N L{0}. 

Define a® = 1, anda! = a. Assuming that a” has already been defined, define 
a’*! = aq" .a, Itis easy to prove (by induction) the following law of exponents: 
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(i) qntm = q qm 
(ii) (a")” = qr 


Exercises 


3.4.1 Show that (5,6) = (7, 8). 

3.4.2 Show that the subtraction is not associative. 

3.4.3 Show that in Z, x-y = Oimplies thatx = Oory = 0. 
3.4.4 Show that —(x— y) = (y— x). 

3.4.5 Show that (—x)-(—y) = x-y. 


3.4.6 Show that (Z, <) is a totally ordered set in which every nonempty subset 
bounded from above (below) has largest (least) element. Deduce that it is an order 
complete set. Show also that it is not a well-ordered set. 


3.4.7 Show thata < bif and onlyifa +c < b+ cforallc €Z. 

3.4.8 Show thata <b if and onlyif —b < —a. 

3.4.9 Show that | a — b |< max(|a|,| b |), whenever a,b EN. 

3.4.10 Show that divisibility is a reflexive as well as a transitive relation on Z. 
3.4.11 Show that 3 = 1 + 1 + 1 is a prime integer. 


3.4.12 Show that if we divide an integer by 2, then the remainder is either 0 or it is 
1. Deduce that an integer is of the form 2n or of the form 2m + | but not both. An 
integer is called even if it is of the form 2n and it is called odd if it is of the form 
2n+1. 


3.4.13 Show that Z is countable in the sense that there is a bijective map from Z to N. 
Hint. Show that the map f from N to Z given by f(2n) = n, and f(2n+1) = —n 
is bijective. 


3.4.14 Suppose that m/a, n/a and (m,n) ~ 1. Show that mn/a. 


3.4.15 Let a and b be nonzero nonunit integers. Show that there exist distinct primes 
P1, P2,---, Pn and nonnegative integers a1, Q2,..., Qn, 31, G2, ..-, Gn such that 


1 2 On 


a= Py Py **** Pa 


and 
By __ Bo By 
Further, show that 
(a,b) ~ pi'py-++- py 
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and 
61.02 On 
la, b] ad P\'Py ot Das 


where 7; = min(a;, 3;) and 6; = max(aj, (3). 


3.4.16 Let a), a2, ..., a, be integers such that 


Il 
2 


O+a+--+ a 
Show that eacha; = 0. 
3.4.17 Show that | a |? = a? 
3.4.18 Show that (a,b) ~ | implies that (a", b) ~ 1. 
3.4.19 Suppose that (a, b) ~ 1. Show that a/bc implies that a/c. 


3.4.20 Show that the set of all odd integers is in bijective correspondence with the 
set Z of all integers. 


3.4.21 Find g.c.d of 238 and 55, and also integers u and v such that 238u + 55v = 
(238,55). Express it in two different ways as 238u + 55v. 


3.4.22 Let a, b, c be nonzero nonunit integers. Show that 
(a, [b,c]) = [(, }), (a,c)] 
3.4.23 Let n € N. Show that the equation 
xX -Yy=n 


has a solution in N if and only if whenever n is divisible by 2, it is divisible by 4. 
When can the solution be unique? 


3.4.24 Find all possible integral solutions of the equation 
px +4 qa he spe ts 1X +4 p = 0, 


where qj, ..., G,—1 are integers, and p is a positive prime. Determine the conditions 
on the coefficients for a possible solution. 


3.4.25 Let n be a nonzero nonunit positive integer which is not a prime(such an 
integer is called a composite integer). Show that there is a prime p such that p? < n 
and p/n. 
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3.4.26 Suppose that (m,n) ~ 1 but (m+n, m—n) ~ 1.Show that (m+n, m—n) ~ 2. 


3.4.27 Suppose that m and n are co-prime and mn is a square. Show that m and n 
are both squares. 


3.4.28 Establish the following identities using induction on n. 


G@ x” -y = Ca - yx! + ay fee $ y"—!), where n € N 
(ii) If nis odd, then, x” + y" = (x + yy! — x" y + x23 0.4 yt), 


3.4.29 Use the above identities to show that if a” — 1 is prime, thena = 2 andn 
is a prime. The primes of the forms 2? — 1 are called the Mersenne primes. It is 
not known, if there are infinitely many Mersenne primes. 


3.4.30 Suppose that a” + 1 is an odd prime. Show that a is even and n is a power 
of 2. The prime numbers of the forms 27" + 1 are called the Fermat primes. It is 
also not known if there are infinitely many Fermat primes. 


3.4.31 Suppose that n is odd. Show that 8/n* — 1. Suppose further that n is co-prime 
to 3. Show that 6/n* — 1. 


3.4.32 Show that 30/n° — n for all n € N, and for all odd number n, 120/n> — n. 


3.4.33 Let a, b,c be set of pairwise co-prime integers such that a7 + b* = c’. 


Suppose that a is even. Show that there is a pair of co-prime integers u, v such that 
a = 2uv,b = v* — wWiandc = v? + wW. 


3.5 Linear Congruence, Residue Classes 


Let m be a fixed positive integer. Let a, b € Z. We say that a is congruent to b 
modulo m, if m divides a — b. We use the notation 


a = b(mod m) 
to say that a is congruent to b modulo m. Consider the relation R on Z given by 
R ={(a,b)€ZxZ\|a = b(mod m)}. 


Thus, (a, b) € Rif and only if m/a — b. It is easy to verify that R is an equivalence 
relation on Z. The equivalence class R, determined by a is denoted by a. Thus, 


a@ = {be Z|m/a—bd} 
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From the properties of equivalence classes, we have the following: 


(i) ae aforallae Z. 
(il) a = bif and only if m/a—b. 
(iii) @ A b if and only if @(.\b = Y. 


To say that m/a— b is to say that the remainder obtained when a is divided by m is 
the same as the remainder obtained when b is divided by m. More explicitly,a@ = 7, 
where r is the remainder obtained when we divide a by m. Further, if0 < 71,72 < m, 
then 7; = 7 if and only if r, = rz. Let Z, denote the quotient set Z/R. Then, it 
follows that Z,,, contains m elements. Indeed, 


Zm = fa|aéeZ} = {0,1,2,---,m— 1} 


The set Z, is called the set of residue classes modulo m. 


Proposition 3.5.1 There are binary operations ® and x on Z» given by 


a@b=a+t+b 


and 


QI 
a 
nad 
Il 
Q 
> 


where a,b € Z. 


Proof Suppose thata@ = a’ andb = Db’. Then m/a—a' and m/b — b’. But, then 
m/(a+ b) — (a’ +b’). Hence 


a@b=a+t+b=da+b=d 6p 


This shows that @ is indeed a binary operation. Further, ab — a’b’ = ab — a'b + 
ab — dab = (a—d)b + da (b—D’) is divisible by m. This shows that@ *« b = 


a xD. tt 


The proof of the following proposition is straightforward and is left as an exercise. 


Theorem 3.5.2 The triple (Zm, ®, *) satisfies the following properties. 


(i) @@b) Ot = 4ObODO for alla, b,c € Z. Thus, ® is associative. 
(ii) a® b=b® a for alla, b € Z. Thus, ® is commutative. 
(iii) O@@ = @ = G0 foralla in Z. Thus, 0 is the identity for ®. 
(iv) a® (—a) = 0 = (—a) @Gfor alla € Z. Thus, every element a € Zy, has 
inverse —a in Z,, with respect to ®. 
(v) @@b = GPC implies that b = ©. Thus, the cancellation law holds for ® 
in Zim. 
(vi) (@xb) xt = Gx (b*Z) for all a, b,c € Z. Thus, * is associative operation. 
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(vii) ax b = bx a for alla, b € Z. Thus, * is commutative. 
(viii) Gx (b@C) = (xb) @ (xO) forall a, b,c € Z. Thus, x distributes over ® 
from left. 
(ix) (@@b)*e = (Gx) @ (b«@) forall a,b, c € Z. Thus, « distributes over ® 
from right also. 
(x) 1x@ = @ = Gx 1 foralla€ Z. Thus, | is the identity for x. ft 


An equation 
aX = b(modm), 


where a, b € Z and X is unknown is called a linear congruence. This equation can 
be read in Z,, as 
axX =b 
An integer c is said to be a solution of the above congruence if 


ac = b(mod m), 


or equivalently m/ac — b. 


Remark 3.5.3 A linear congruence aX = b(mod m) need not have any solution. 
For example 2x = 1(mod 4) has no solution in Z (for 2c — 1 is never divisible by 
4). 


The following is a necessary and sufficient condition for a linear congruence to 
have a solution. 


Theorem 3.5.4 A linear congruence aX = b(mod m) has a solution in Z if and 
only if (a, m) divides b. (Equivalently an equation GX = b in Zp has a solution in 
Zym if and only if (a, m) divides b.). Further, if c is a solution of aX = b(mod m), 
then (c, m) divides b. 


Proof Suppose that aX = b(mod m) has a solution c € Z. Then ac = b(mod m), 
and so m divides ac — b. Suppose that ac — b = qm, or equivalently ac — qm = b 
for some q. Then, any common divisor of a and m divides b, in particular (a, m) 
divides b. 


Conversely, suppose that (a, m) divides b. Letd = (a,m). Then, thereisag € Z 
such that b = qd. By the Euclidean algorithm, there exist u, v € Z such that 


ua +um=d 


Thus, 
b = qd = qua + qum 


But, then a(qu) = b(mod m). Hence, qu is a solution of the given congruence. 
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Further, let c be a solution of aX = b(mod m), then a is a solution of the 
congruence cX = b(mod m). From what we have just proved, it follows that 
(c, m)/b. tt 
Corollary 3.5.5 aX = 1(mod m) has a solution if and only if a and m are 


co-prime (i.e.(a,m) ~ 1). Further, if c is a solution, then c and m are also co- 
prime. In particular, fora € Zp, there isab € Z, such thata*xb = 1 if and only 
if (a,m) ~ 1. 


Proof (a,m)/1 if and only if (a,m) ~ 1. The result follows from the above 
theorem. tt 


Algorithm to find solutions of a linear congruence 

Two solutions X; and X2 are said to be congruent if X; = X2(mod m). The solutions 
X, and X> are said to be incongruent, otherwise. The purpose is to give an algorithm 
to find all incongruent solutions of the equation aX = b(mod m). If (a, m) does not 
divide b, then there is no solution of the congruence. 

Suppose that (a, m)/b. Let d be the positive greatest common divisor of a and m. 
Then d/a, d/m and also d/b. Suppose thata = du, b = dvandm = dw. Since 
d is ag.c.d of a and m, (u, w) ~ 1. Further, then m divides ac — m if and only if w 
divides uc — v. Thus, c is a solution of aX = b(mod m) if and only if it is a solution 
uX = v(mod w). More explicitly, the solutions of ax = b(mod m) are same as 
those of uX = v(mod w). 


Theorem 3.5.6 Jf (u, w) ~ 1, then the linear congruence 
uX = v(mod w) 


has a unique incongruent solution. In other words, there is a unique c, 0 < c < w 
such that uc = v(mod w). 


Proof By the Euclidean algorithm, we can find integers y and z such that wy + wz = 
1. But, then uyv + wzv = v. Hence u(yv) = v(mod w). Let c be the remainder 
obtained(division algorithm) when we divide yu by w. Then0 < c < w and 


uc = v(mod w). Let cy, co, O < cy < wand0 < cy <_ w be such that 
uc} = v(mod w) and ucp = v(mod w). Then uc; = uc2(mod w), and so 
w/uc, — ucz. Since (u, w) ~ 1, w/c; — co. This shows that cy; = cp. tt 


Theorem 3.5.7 Letd be a positive g.c.dofaandm which divides b. Leta = du, b = 
dv, andm = dw. Let c be the unique solution of the congruence uX = v(mod w), 
whereO < c < w.Thenc, c+w,c+2w,---,c + (d— lw are precisely the 
least positive incongruent solutions of aX = b(mod m). In particular, there are d 
incongruent solutions of aX = b(mod m). Any solution of the given congruence is 
of the formc + iw + qm for some i,q, where0 < i < wandq eZ. 


Proof Clearly,c, c + w, c + 2w,---, c¢ + (d— 1)w are solutions of uX = 
u(mod w), and so also of aX = b(mod m) which are pairwise incongruent modulo 
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m. It is sufficient to show that any solution of aX = b(mod m) is congruent to one 
of the above solutions. Let / be a solution of aX = b(mod m). Then, it is also a 
solution of uX = v(mod w). From the previous theorem, / = c(mod w). Suppose 
that! — c = kw. By the division algorithm,k = qd + r, whereO<r<d-—l. 
Thus, / — c = rw + qm. This shows that] = (c + rw)(mod m). tt 


We illustrate the above algorithm by finding the least positive incongruent solu- 
tions of the congruence 
51x = 12(mod 87) 


The positive greatest common divisor(use Euclidean algorithm) of 51 and 87 is 
3 which divides 12. Thus, the solutions of 51X = 12(mod 87) are same as those 
of 17X = 4(mod 29). Further if c is the least positive incongruent solution of 
17X = 4A(mod 29), thenc, c + 29, c + 58 is the complete list of least positive 
incongruent solutions of 51X = 12(mod 87). Since (17, 29) ~ 1, using Euclidean 
algorithm we find integers 12 and —7 such that 


1= 12x17 — 7x 29 


Hence4 = 48x 17 — 28 x29. This shows that 48 is a solution of 17X = 4(mod 29). 
The least positive solution is obtained by dividing 48 by 29 and taking the remainder. 
Thus, 19 is the least positive solution of 17X = 4(mod 29). Therefore, The complete 
list of least positive incongruent solutions of 51X = 12(mod 87) is {19, 48, 67}. 


Linear Diophantine Equations 
An equation 
aX; + aX, + ---+ aX, = b, 


where a, d2,--- ,a;, b are integers and X1, X2,--- , X, are unknowns, is called a 
linear diophantine equation in r variables. Solutions of this equation are to be 
determined (if they exist) from the set of integers. For simplicity, we consider linear 
Diophantine equations 

aX + bY =c 


in two variables only. We are interested in solving this equation in Z. Geometrically, 
this means finding out lattice points (points in Z x Z) on the straight lineaX +bY = c. 
Solutions of the equation. 


aX + bY =c (3.5.1) 


If the above Eq. (3.5.1) has a solution, then g.c.d of a and b divides c. Let d be the 
positive greatest common divisor of a and b. Suppose thata = du, b = dv, c = 
dw. Then the solutions of (3.5.1) are same as those of 


uX + vY = w (3.5.2) 
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Clearly, (u, v) ~ 1. By the Euclidean algorithm, we can find integers m, n such that 
um + vn = 1 


But, then 
umw + vnw = W 


Thus, x) = mw and yo = nw give a solution of (3.5.2), and so of (3.5.1). Further, 
since x9, yo 1s a solution of (3.5.2), for any integer t, x» + tv, yo — tu also 
constitutes a solution of (3.5.2) (check it). Conversely, if x’, y’ also constitutes a 
solution of (3.5.1) and so of (3.5.2), then 


uxy + vyyg = w= ux’ + vy, 


and so 
u(x’ — xo) + v(y'—yo) = 0 


Since (u,v) ~ 1, u/y' — yo and v/x' — xo. Suppose that x — x9 = tv. Then 
utv + v(y'—yo) = 0,andsoy’—yo = —tu. Thus,x’ = x) + tuandy = yo—tu 
for some integer f. It follows that the set 


oe 2 1“) |t eZ} 
XO qd’? d 


is precisely the set of all solutions of the equation (3.5.1). 
To illustrate the above algorithm, we find all solutions of the equation 


S51X + 87Y = 12 


The positive greatest common divisor of 51 and 87 is 3 which divides 12. Thus, 
it has a solution and its solutions are same as those of 


17X + 29Y = 4 
By Euclidean algorithm, we find that 
17 x 48 + (—28) x 29 = 4 
This gives us a solution (48, —28) of the given equation. Thus, the set {(48 + 


29t, —28 — 17t) | t € Z} is precisely the set of all solutions of the above equa- 
tion. 
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Exercises 


3.5.1 Let p be a prime which does not divide a. Show that there is an integer b such 
that ab = 1(mod p). 


3.5.2 Suppose that (a,m) ~ 1. Let b € Z. Show that there is an integer c such that 
ac = b(modm). 


3.5.3 Show that the congruence 25X = 6(mod 30) has no solution in Z. 

3.5.4 Find the set of least positive incongruent solutions of 22X = 4(mod 18). 
3.5.5 Find the set of least positive incongruent solutions of 12X = 15(mod 9). 
3.5.6 Find out solutions of the congruence 3X + 20 = 17(mod 15). 

3.5.7 Find out the solutions of X? = 1(mod p), where p is a prime number. 


3.5.8 Let m € N. Let {a), a2,--- , dm} be a set of integers such that every integer is 
congruent to some a; modulo m. Let / € Z be such that (/, m) ~ 1. Show that every 
integer is congruent to some /a;. 


3.5.9 Consider the linear congruence 
aX + bY = c(mod m) 


Show that it has a solution if and only if (a, b, m) divides c. Call two solutions x, y, 
and x2, y2 congruent if x2 = x,;(mod m) and y2 = y,(mod m). Find an algorithm 
to find the least positive incongruent solutions of the above linear congruence. Show 
that there are dm least positive incongruent solutions of the above congruence, where 
d is the positive g.c.d of a, b, m. 


3.5.10 Generalize the above exercise to the congruence in n variables. Show that 
there are m’—~!d incongruent solutions, where d is positive g.c.d of a), d2,--+ , Gn 
which divides b. 


3.5.11 Determine least positive incongruent solutions of 3X + 7Y = 8(mod 15). 


3.5.12 Find a general solution of the linear Diophantine equation 641X + 372Y = 
1254. 


3.5.13* Let a, b € N and (a, b) ~ 1. Let r be the smallest nonnegative integer such 
that aX + bY = nis solvable in N for alln > r. Show thatr = (a— 1)(b— 1). 
Show further that for half of the nonnegative integers < r, the equation will have 
a solution in N and for other half not. (This is a particular case of the Frobenius 
problem in two variables.) 

Hint. Look at the lines parallel to aX + bY = O and see when it always passes 
through a lattice point in the first quadrant. 
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3.6 Rational Numbers 


This is a brief section in which we introduce the rational number system Q and their 
properties through a set of graded exercises. 

The equation aX = b, where a,b € Z need not have any solution in Z. For 
example, 2X = | has no solution in Z (prove it). The purpose of this section is to 
enlarge the system of integers to a bigger system Q of rational numbers in which we 
can solve these equations whenever a # 0, and without loosing any other algebraic 
properties of Z. 

The verification of most of the claims in this section is simple, straightforward 
(the proofs are similar to those in the introduction of integers from natural numbers) 
and will be left as exercises. 

The solution of the equation aX = b, where a # 0 depends on the pair (b, a). 
This prompts us to consider the set X = Z x Z*. Since the solution of ax = band 
that of caX = cb,c # 0 are same, we define relation to identify pairs (b, a) and 
(bc, ca) for all c € Z*. More precisely, define a relation ~ on X by 


~= {((a,b), (c,d)) €X x X | ad = be}. 


3.6.1. Show that ~ is an equivalence relation on X. 


The equivalence class determined by (a, b) will be denoted by 5 Thus, 


; = {(c,d)€X|ad = be} 


a is called the numerator and b is called the denominator. Clearly, 


: a 5 if and only if ad = be 
Let Q denote the quotient set X/ ~ and call it the set of rational numbers. 
3.6.2. Show that we have two binary operations on Q denoted by + and - which 


are given by yas 
a c ad + bc 


ar hai ma 
and 
a Co =, ee 
b od bd 


3.6.3. Show that for every nonzero member x of Q, there is a unique pair a, b, 
a € Z,b € Nand (a,b) ~ 1 such thatx = ; 


3.6.4. Show that the map f from Z to Q defined by f(a) = - is embedding in 


the sense that it is injective and respects addition and multiplication. 
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We shall identify a and f(a) = “ 


3.6.5. Show that the order < in Z can also be extended to that in Q by defining 
z < §(b,d €N)ifad < bc. Check that it is a total order. Show also that the law 


of trichotomy holds here also. 
The proof of the following theorem is left as an exercise. 


Theorem 3.6.1 (Q,+,-, <) is an ordered field in the sense that it satisfies the fol- 
lowing conditions: 


(i) + is associative. 
(ii) + is commutative. 
(ii) x +0 =x=04 xforallxeQ 


—a 


(iv) For allx = : € Q, there exists —x = = € Q such that 
x+—-x =0O= -x4x. 


(v) - is associative. 
(vi) - is commutative. 
(vii) x-l =x = 1-xforallxeQ 


(viii) Forallx = ; €Q, a0, there exists x"! = 2 # 0 € Q (called the 


Po yay! 


inverse of x) such that x - x~ 
(ix) - distributes over + from left as well as right. 


(x) LetP = {x €Q|x > O}. Given any x € Q, show that one and only one of 
the following holds: 


(a) x EP. 
(b) x = 0. 
(c) —x EP. 


(xi) x,y € P implies thatx + yandx-y € P. 
3.6.6. Show that if x7 + x3 + +--+ x2 = 0, thenx; = 0 for alli. 
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3.6.7. Let a, b,c and d be members of Q such thata < bandc < d. Show 
that there is a bijective map from the set {x € Q | a < x <_ )b} to the set 
{x € Q|c < x < d}. Indeed, there is a bijective map from these sets to Q also. 

3.6.8. Show that Q is countable in the sense that there is a bijective map from N 
toQ. 

Hint. It suffices to show (from the above exercise) the countability of {x € Q | 
0 < x < 1}. Observe that any such rational number is expressible as “ where 
0 < m < n. Apply double induction first on m and then on n. 

3.6.9. Let p be a prime number in N. Then, X?_ = p has no solution in Q. For, 
suppose that (2)? = p, where (m,n) ~ 1. Then m? = np. But then p/m and so 
p*/n’p. This means that p/n, a contradiction. 

3.6.10. Let m € Z. Show that X* = mhas a solution in Q if and only if it has a 
solution in N () {0}. 

Hint. Use the above exercise. 

3.6.11. Show that every equation aX = b, where a € Q* andb € Q, has a 
solution in Q. 

3.6.12. Show that an equation X” + aX"! 4+ ax"? + .-- +.a,, where 
a; € Z, has a rational solution if and only if it has an integral solution. 

3.6.13. Show that Q is not order complete with respect to the order <. 

Hint. Show that {x € Q| 0 < x and x? < 2} has an upper bound but it has no 
least element. 

3.6.14. Define absolute value of a rational number as we defined for Z. Also prove 
all the properties which were proved to be true over Z. 


3.7. Real Numbers 


In this section, we enlarge the system Q to a bigger system R of real numbers which 
has all the property of Q as ordered field with an extra important property of being 
order complete. As a consequence, it will also have solutions of the equations X* = a 
foralla > 0. Here also we shall develop the system with the help of graded exercises. 
The reader may take the help of a book on real analysis, for example, ‘Principles of 
Real Analysis’ by W. Rudin, in case of any difficulty in proving the facts. 

A mapf from N to a set X is called a sequence in X. A sequence f is also denoted 
by {f(n)}. 


A sequence f in Q is called a Cauchy sequence in Q if 


foralle € Q, € > O, there exists no € N such that 


n,m > no implies that | f(n) —f(m) |< «. 


Let us denote the set of all Cauchy sequences in Q by I. 
3.7.1. Let f and g be members of I’. Show that f + g and f - g defined by 
(+g) = fm) + gm) and (f-g)(2) = fn) - gm) are also members of I’. 
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3.7.2. Show that the system (I’ + -) is a commutative ring in the sense that it 
satisfies the following properties. 


(i) + and - are associative as well as commutative. 
(ii) + has the identity 0 (called zero) given by O(n) = 0 € Qfor alln € N. Thus, 
ft+0=f=0+4 fforalfeT. 
(iii) - has the identity | given by 1(n) = 1 € Qforalln € N. Thus,f-1 = f = 1-f 
forallf eT. 
(iv) - distributes over + from left as well as from right. Thus, f-(g+h) = f-g+f-h 
forallf,g,h eT. 


Call a sequence f € IT to be a nullsequence if for alle > 0, € € 
Q, there exists no € N such that n > no implies that | f(n) |< «. Define a 
relation ~ on [I as follows: 


ft ~ gif and only if f — g is a null sequence. 


3.7.3. Show that ~ is an equivalence relation. 

Let R denote the quotient set '/~. Let us denote the equivalence class determined 
by f by f. 

3.7.4. Suppose that f ~ f’ and g ~ g’. Show thatf+g ~ f’+g/ andf-g ~ f'-g’. 
Deduce the existence of binary operations + and - on R given by 


f+g=F+9 


and _ = 
f-g=f9 


3.7.5. For each r € Q, let f denote the constant sequence given by f.(n) = 
r for all n € N. Show that the map ¢ from Q to R defined by ¢(r) = f, is an 
embedding (injective map which preserves operations). 

We further identify r in Q by ¢(r) inR. 

3.7.6. Show that the order < in Q can also be extended to that in R by defining 
f < Gif and only if there exists no € N such that n > no implies that f (n) < g(n). 

3.7.7. Show that (IR, +,-,P), where P = {x € R| x > O}, is an ordered field in 
the sense that it satisfies all the properties of Q listed in the Theorem 3.6.1. 

The following facts can be proved with a little more effort (The reader may 
consult a book on elementary real analysis, for example, “Principles of Mathematical 
Analysis” by W. Rudin, in case of any difficulty.): 

3.7.8. Show that R is order complete with respect to the order <. 

3.7.9. The concept of absolute value can be introduced on R in the similar manner 
as it was done for Q. Show that it also obeys the same laws. 

3.7.10. Define the concept of Cauchy sequence in R as we did it in Q. If we repeat 
again the process by taking R at the place of Q, then we will not be getting any 
thing bigger that IR. More precisely, show that if f is a Cauchy sequence in R, then 
there exists r € R such that f ~ f,, where f, is given by f,(n) = r, foralln éN. 
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3.7.11. A sequence f in R is said to converge to an element a € R if given any 
e > O, € € R, there is a natural number ng € N such that n > no implies that 
| f(n) —a|< €. ais called the limit of the sequence f, and we express it by writing 
limn—soof (n) = a. Show that R is complete in the sense that every Cauchy sequence 
in R converges to a unique point in R. 

We state the following facts. The proofs of these facts can be found in the book 
“Principles of Mathematical Analysis” by W. Rudin. 

1.Letn € ZandaéR, a > O. Then, the equation X” = ahasa solution in R. 
If n is an odd integer, then the equation X” = a will have a solution for all a € R. 
Thus, ai can be defined for all n € Nanda e€ R, a > 0. In turn, a’ can be defined 
forallre Qandae R, a>0. 
2. Archemidean property of IR. Given any real number a, there is a natural number 
n such thatn > a. 
3. Qis dense in R in the sense that every real number is limit of a sequence in Q. This 
is equivalent to say that between any two distinct real numbers there is a member of 
Q. 
4. The real numbers which are not rational numbers are called irrational numbers. 
Thus, the solutions of X? = 2 which are denoted by +,/2 are all irrational numbers. 
The set R — Q of irrational numbers is also dense in the same sense. 
5. Leta € R, a > Oandb € R. Suppose f is a sequence in Q which converges to 
b. Then, we have a sequence a’ defined by a(n) = a/. It can be checked that this 
is also a Cauchy sequence. The limit of this sequence is defined to be a’. The law of 
indices is true here also. 
The map x ~» a‘, a > O isa bijective map from the set R to the set R* of positive 
real numbers. The inverse of this map is denoted by /og, which is a bijective map 
from Rt to R. This map is called logarithm to the base a. 
6. We define n! for all n € N J {0} inductively as follows. Define 0! = 1 and also 
1! = 1. Assuming that n! has already been defined, definen+ 1! = (n+ 1)- al. 
Thus, n! = 1|-2----n. The sequence {1 + i + x ftoeee Ht +} can be seen 
to be a Cauchy sequence. In turn, it will converge to a unique real number denoted 
by e. This number e is called the exponential. For any real number x, the sequence 
{l + x + x +oeee+ vy is also a Cauchy sequence, and its limit is in fact e*. 


7. The sequence {x — 7 tee (-1)! =} is also a Cauchy sequence, and its 
limit is denoted by sin x. This defines a function sine from the set of reals to itself. 
Similarly, the sequence {1 — F + +--+ (—1)"35} is also a Cauchy sequence, 


and its limit is denoted by cos x. This defines a function cosine from R to R. These 
functions obey all trigonometrical identities with which the reader is familiar. 
8. The sequence {1 + a a +} is also a Cauchy sequence. The number 7 is 


defined by the requirement that a is the limit of this sequence. 

9. IR is uncountable. 

10. The equation X? + 1 = Ohas no solution in R. In fact, X? + 1 = Ohas no 
solution in any ordered field. 
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3.8 Complex Numbers 


The equation X7 + 1 = 0 has no solution in R. We wish to enlarge the system 
R to a system to include the solution of the above equation. The best that we can 
expect is to retain the field properties of IR. Obviously, the order property cannot be 
retained. Let us denote the enlarged system (if possible) by C and denote the solution 
of X*7 + 1 = Obyi. Then i? = —1. We denote the extended operations by same 
notations + and -. The set {a + bi| a, b € R} is contained in C. Ifa + bi = c + di, 
thena—c = (d—b)t. Hence if d ¢ b, theni = (a—c)(d — b)~! belongs to R 
which is impossible. Thus, b = dandalsoa = c. Clearly, 


(a+ bi) + (c+di) = (atc) + (b4+d)i, 
and 
(a+ bi)-(c+di) = (ac+bdi? + (ad+bc)i = (ac— bd) + (ad + be)i 


Thus, the set {a+bi | a, b € R} is closed under the binary operations. Further, it can be 
checked that (a+ bi) - (a*p -—i aes) = | (observe that we are identifying a € R 
by a+i0 in C). Thus, we have the required enlarged system C = {a+bi | a,b € R}. 
The following fact is known as the fundamental theorem of algebra. The proof 
of this fact will be given in the Chap. 9 of algebra 2. 


Theorem 3.8.1 Any equation 
ay + aX + aX? + +--+ aX” = 0, 


where all a; € C, has a solution (in fact all solutions) in C. tt 


Given any complex number z = a+ bi, ais called the real part and b is called the 
imaginary part of z. The complex number a — bi is called the conjugate of a+ bi, and 
it is denoted by a + bi. The real number a* + b? is nonnegative, and its nonnegative 
real square root is called the modulus of a + bi, and it is denoted by | a + bi |. The 
modulus on C satisfies the same properties as the absolute value satisfies on R. 

Cauchy sequences can be defined in C also as it was defined in R. C is also 
complete in the sense that every Cauchy sequence in C converges in C. e, cos z, sin z 
can be defined for all complex numbers z as it was defined for real numbers. It follows 
that e = cos x + isin x for all real numbers x. The set S' = {z € C || z|= l}is 
called the circle or torus of dimension 1. Thus, every member of S ! is of the form 
cos x + isin x, and so it is of the form e, where x € R. For details, refer to any 
book on Complex Analysis, for example, “Complex Analysis” by Ahlfors. 


Chapter 4 
Group Theory 


One of the most fundamental concepts in mathematics today is that of a group. 
Germs of group were present, even in ancient times, in the study of congruences of 
geometric figures and also in the study of motions in space. It started taking shape 
in the beginning of the nineteenth century. One of the most challenging problems at 
that time was the problem of solvability of general polynomial equations of degree 
n,n > Sby the field and radical operations (addition, subtraction, multiplication, and 
division by nonzero elements and taking mth roots for different m). Paulo Ruffini 
(1736-1813) and Niels Henrik Abel (1802-1829), using the structure of a set of 
permutations on the set of roots of the polynomials, proved that a general nth degree 
equation n > 5 is not solvable by the field and the radical operations. Evariste 
Galois (1811-1832) discovered that the key factor behind the algebraic solvability 
of a polynomial equation is an structure (called the Galois group of the polynomial 
equation) and proved that a polynomial equation is solvable by the field and the 
radical operations if and only if the corresponding structure (namely the Galois 
group) possesses a property (called the solvability). 

In the second half of the nineteenth century, the notion of the congruences of 
geometric objects was further generalized. The development during this period was 
influenced by the works of Sophus Lie (1842-1899), Felix Klein (1849-1925), Henri 
Poincare (1854-1912), and Max Dehn (1878-1952) on geometry and topology. The 
importance of the study of permutation groups, continuous groups, groups of home- 
omorphisms, and fundamental groups was realized, and this lead to the formulation 
of an abstract group. The notion of an abstract group is present in the works of Arthur 
Caley (1821-1895) and von Dyck (1856-1934). 

Theory of groups developed slowly but steadily in the first half of the twenti- 
eth century with some very significant contributions by G. Frobenius (1849-1917), 
William Burnside (1852-1957), Isai Schur (1875-1936), O. Schreier (1901-1929), 
P. Hall (1904-1982), and others. Theory of finite groups picked up momentum with 
the works of R. Brauer (1901-1977) and his students in 1955. Theory of groups, 
now, has tremendous applications and interest in itself. 
© Springer Nature Singapore Pte Ltd. 2017 93 


R. Lal, Algebra 1, Infosys Science Foundation Series in Mathematical Sciences, 
DOI 10.1007/978-98 1-10-4253-9_4 


94 4 Group Theory 


4.1 Definition and Examples 


Let G be a set. Recall that a map 0 : G x G —> Gis called a binary operation on 
G. The image of (a, b) under this binary operation is denoted by aob. 


Definition 4.1.1 A groupoid is a pair (G, 0), where G is a set and o is a binary 
operation on G. 


The pairs (N, bs (N, ), (Z, +), (Z, =), (Z, ), (Zn, 8), (Zn; x), Q +), 
(Q, -), (Q, —), (R, +), (R,-), (R, -), (C, +), (C, -), and (C, —) are all examples 
of groupoid. 

The first and the second projections from G x G to G are distinct binary operations 
on G provided that G is different from singleton. 

Union, intersection, relative compliment, and symmetric difference are all binary 
operations on the power set of a set Y. 

If G is a finite set, then a binary operation on G can be defined by a table called 
multiplication table. Thus, the following table defines a binary operationon G = 
{a, b, c, d}. 


ola\b\c|d 
a\bla\di\c 
biaja\c|d 
c|b\b\cja 
d\c|d\b\c 


The binary operation o is evident from the table. For example, boc = candcob = b. 
Recall that a binary operation o on G is said to be an associative operation if 


(aob)oc = ao(boc) forall a,b,c € G, 
and it is said to be a commutative operation if 
aob = boa foralla,beG. 


The usual addition ‘+’ and multiplication ‘.’ on N, Z, Q, R and C are all asso- 
ciative as well as commutative operations. However, the subtraction — is neither 
associative nor commutative. The projection maps on G x G are associative but not 
commutative unless G is singleton. The operation 0 on N defined by aob = a* + b* 
is commutative but not associative. The binary operations @ and * on Z,, (see Propo- 
sition 3.5.2) are associative as well as commutative. The U, (), Aare all associative 
as well as commutative on the power set of a set. 


Definition 4.1.2 A pair (G, 0), where G is a set and o is an associative binary 
operation on G, is called a semigroup. If the binary operation o is also commutative, 
then we say that it is a commutative semigroup. 
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The pairs (N,+), (N,-), +), (-), Q+), @) @+4, BB), 
(C,+), (C,-), (Zn, ®), and (Z,,,*) are all commutative semigroups. The pair 
(G, p;), where p, is the first projection, is a noncommutative semigroup. The pair 
(Z, —) is not a semigroup. 

Let X be a set having more than 2 elements and G = R(X) be the set of all 
relations on X. Then, the pair (G, 0), where o is the composition of relations on X, 
is anoncommutative semigroup. Similarly, if we denote by F(X) the set of all maps 
from X to X, then (F(X), 0) is anoncommutative semigroup. 


Definition 4.1.3 Let (G, 0) be a groupoid. An element e € G is called a left(right) 
identity of (G, 0) if eoa = a(aoe = a) foralla € G. If e is left as well as right 
identity, then we say that it is both-sided identity. 


0 is both-sided identity of (Z,+), and 1 is both-sided identity of (Z, -). 0 is 
both-sided identity of (Z,,, ®), and 1 is both-sided identity of (Z,,, *). The empty 
set % is both-sided identity of (P(Y), LU) and also of (P(Y), A). Y is both-sided 
identity of (P(Y), ()). The semigroup (N, +) has no identity. 0 is the right identity 
of the groupoid (Z, —) which has no left identity. Every element of G is a right 
identity of (G, p1), where p, is the first projection on G x G. Similarly, in (G, pz), 
every element is left identity but no element is right identity. However, we have the 
following: 


Proposition 4.1.4 Let (G, 0) be a groupoid. Let e, be a left identity and e, a right 
identity of (G, 0). Then e; = e2 and it is both-sided identity. 


Proof Since e> is a right identity and e, is a left identity,e,; = e,0e, = e. tt 


In case a groupoid (G, o) has both-sided identity, it is unique, and we call it the 
identity. 


Definition 4.1.5 A semigroup with identity is also called a monoid. 


Definition 4.1.6 Let (G, 0) be a groupoid. Let e be a left(right) identity of (G, 0). 
Let a € G. Anelement a’ € G is called a left(right) inverse of a with respect to e 
ifa’oa = e(aoad’ = e). If e is the identity of (G, 0), then a’ is called both-sided 
inverse of a if 

aoa’ =e = doa 


Example 4.1.7 If (G, 0) is a groupoid and e is a left(right) identity of (G, 0), then e 
is left(right) inverse of e with respect to e. In (N, -), 1 is the identity and | is the only 
element which has inverse (both-sided). In (Z, -), 1 is the identity, and the inverse 
of | is | and that of —1 is —-1. No other element has inverse (verify). In (Z, +), 0 
is the identity element, and —a is the both-sided inverse of a. In (Z,,, ®), 0 is the 
identity and —a is the inverse (both-sided) of @. In (Z,»,, *), 1 is the identity and @ 
has a inverse b if and only if (a,m) ~ 1 (Corollary 3.5.5). In (P(Y), A), @ is the 
identity and inverse of every element A € P(Y) is A itself (verify). In (Z, —), 0 is 
a right identity and every element of Z is right inverse of itself with respect to 0. 
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Left inverse and right inverse of an element in a groupoid with identity may be 
different. For example, consider the groupoid (G, 0), where G = {e,a, b,c} ando 
is defined by the following multiplication table. 


b 
b 
e 
a 
c 


O 
e 
a 
b 
c 


e 
e 
a 
b 
c 


SV] SPS ALS 


a 
a 
c 
b 
e 


Here, e is the identity, aob = e = coa. Thus, b is a right inverse, and c is a left 
inverse of a. However, the following proposition says that this can not happen in a 
semigroup. 


Proposition 4.1.8 Let (G, 0) be a semigroup with the identity e. Leta € G. Let a’ 


be a left inverse and a’ a right inverse of a. Thea’ = a". 
Proof Under the hypothesis of the proposition, a’ = a'oe =a'o(aoa") = (a'oa)oa" 
=eoa" =a". tt 


The above proposition says that both-sided inverse of an element a in a semigroup 
with identity, if exists, is unique. The inverse of an element a in a semigroup (G, 0) 
with identity is usually denoted by a~'. Thus, 


1 


a 'oa = e = aoa~ 


If we denote the binary operation additively by +, which we usually do when the 
binary operation is commutative, then the identity is denoted by 0 and the inverse of 
an element a is denoted by —a. Thus, 


-a+t+az=0=a+4-a 


Definition 4.1.9 Let (G, 0) be a semigroup. Let a, b € G. An equationaoX = b 
(Xoa = b) is said to be solvable if there exists c € G (called a solution of the 
equation) such that aoc = b(coa = b). 


Theorem 4.1.10 Let (G, 0) be a semigroup, where G is anonempty set. The follow- 
ing conditions on (G, 0) are equivalent. 


I. Equations aoX = band Xoa = bare solvable for all a,b € G. 
2. (G,o) has a left identity e such that every element of G has a left inverse with 
respect to @, 1.e. 


(i) there exist e € G such that 


eoa = aforallaeéG, 
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and 
(ii) foralla € G, there exists a’ € G such that 
1g 
aod = e. 


3. (G, 0) has a right identity e such that every element of G has a right inverse with 
respect to @, i.e. 


(i) there exists e € G such that 
aoe = aforallaeG, 

and 

(ii) foralla € G, there exists a’ € G such that 

aoa’ = e. 
4. (G, 0) has the identity and every element of G has the inverse, i.e. 
(i) there exists e € G such that 
eoa = a = aoe forallaeG, 


and 
(ii) forall a € G, there exists a“! € G such that 


a'oa = e = aoa”. 


5. Equations aoX = band Xoa = b have unique solutions in G for alla, b € G. 


Proof (1 => 2). Assume 1. Since G # @, there is an element a € G. By (1), the 
equation Xoa = a is solvable. Let e be a solution of Xoa = a. Then, 


eoa =a (4.1.1) 
We show that e is a left identity. Let b € G. By (1), the equationaoX = bhasa 
solution c (say). Then, 


aoc = b (4.1.2) 


Now, using 4.1.1 and 4.1.2, and the associativity of 0, 
eob = eo(aoc) = (eoa)oc = aoc = b 
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This shows that e is a left identity. Let a € G. A solution of the equation Xoa = e 
gives an element a’ such that 
aoa =e 


This proves that 1 ==> 2. Similarly, | => 3. 

(2 ==> 4). Assume 2. Let e be a left identity and a’ a left inverse of a with respect 
to e. We show that ¢ is also a right identity and a’ is also a right inverse of a. We first 
show that 

aoa’ =e 
Since every element of G has a left inverse with respect to e, for a’ there a” €G 
such that 

aoa’ =e (4.1.3) 


Now, using the Eq. 4.1.3 and the associativity of the binary operation, 


aoa’ = eo(aoa’) = (a"oa')o(aoa') = a"o(a'o(aoa’)) = a’o((a'oa)oa') = 
a”o(eoa') = aod = e. 
Thus, 
aoa’ =e (4.1.4) 


In turn, using 4.1.4, 

aoe = ao(a'oa) = (aoa')oa = eoa = a 
Thus, e is the identity and a’ is the inverse of a, which we denote by a~'. This proves 
2=—> 4. 

The proof of 3 => 4 is similar to that of 2 => 4. 

(4 => 5). Assume 4. Then, a~!ob is a solution of the equation aoX = b, and 
boa! isa solution of Xoa = b. Further, if c and d are solutions of aaX = b, then 
aoc = b = aod. But, thenc = (a~'oa)oc = a~!o(aoc) = a7'o(aod) = 
(a~'!oa)od = eod = d.Thus,aoX = bhasa unique solution. Similarly, 
Xoa = bhas a unique solution. 

5 => | is evident. tt 


Definition 4.1.11 A semigroup (G, 0) satisfying any one (and hence all) of the 
above 5 equivalent conditions in the theorem is called a group. A group (G, 0) is 
said to be an abelian group (after the name of Abel) or a commutative group if the 
operation o is commutative. 


In an abelian group, a binary operation is usually denoted by + called the addition, 
the identity is denoted by 0 called 0, and the inverse of an element a is denoted by 
—a called the negative of a. 


Example 4.1.12 (Z,+), (Q, +), (R, +), (C, +) are all infinite abelian groups. 0 
is the identity and the inverse of a is —a. 
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Example 4.1.13 The usual multiplication - in Q, R, and C induces multiplications 
inQ* = Q-{0}, R* = R—{0}, C* = C-— {0} with respect to which all of these 
-1 


are abelian groups. | is the identity, and the inverse of an element a isa~* = 1. 


Definition 4.1.14 If a group (G, o) is finite, then the number of elements in G is 
called the order of the group G and is denoted by | G | or 0(G). 


Example 4.1.15 The multiplication - in Z induces multiplication in {1, —1} with 
respect to which it is a finite abelian group of order 2. 


Example 4.1.16 Let m be a positive integer. It follows from Theorem 3.5.2 (i)—(iv) 
(Chap. 3) that (Z,,, @) is a finite abelian group of order m. 0 is the identity, and the 


inverse of @ is —a. 


Example 4.1.17 Let m be a positive integer. Let 
Un = {ae Zm | (a,m) ~ 1} 


Note that if (a,m) ~ l anda = bin Z,, then (b,m) ~ 1. Leta € U,, and 
b € U,. Then, (a,m) ~ 1 and (b,m) ~ 1. By Corollary 3.4.10, (ab,m) ~ 1. 
Hence, @*b = ab belongs to U,,. Thus, * induces a binary operation on U,,, which 
we again denote by x. It again follows from Theorem 3.5.2 ((vi), (vii), and (x)) 
(Chap. 3) that (U,,, *) is a commutative semigroup with identity. Further, it follows 
from Corollary 3.5.5 that if (a,m) ~ 1, then there exists an integer b such that 
(bm) ~ 1 and ab = 1(mod m). In other words, there is a b € U,, such that 
a«b = 1 = bx«da. Thus, every element of (U;,, x) has the inverse, and so it is a 
finite abelian group. The group (U,,,, *) is called the group of prime residue classes 
modulo m. 


Definition 4.1.18 The function ¢ from N to N given by é(1) = 1, andforn > 1, 
o(n) = the number of positive integers less than n and co-prime to n is called the 
Euler‘s phi function or Euler’s totient function. 


Thus, the order of the group U,,, of prime residue classes modulo m is p(m). 
Formula for ¢ (m) 


Let p be a prime number and r > 1. The set of positive integers less than or equal 
to p’ and not co-prime to p’” is {p,2p,3p,..., p’! - p}. Thus, the number of 
positive integers less than p” and co-prime to p" is p” — p’~!. This means that 
o(p’) = p’ — p’—'.In particular, d(p) = p — 1 for all prime p. It will be shown 
later in Chap. 7 that the function ¢ is multiplicative in the sense that if (m,n) = 1, 
then d(mn) = o(m)d(n). Thus, if 


ry fo 


rk 
mM = Py Py ---Pr;s 
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where p1, p2,..., px are distinct primes, then 
om) = (p" — p™')(p? — p?")...(p — p®™"). 


For example, (100) = (52) - (22) = (52 — 5)-(22 — 2) = 40. 


Example 4.1.19 Let S' = {z € C || z|= 1} denote the unit circle in the 
complex plane. The multiplication of complex numbers induces multiplication in S! 
with respect to which it is a group, called the circle group or the torus group of 
dimension 1. This is an infinite abelian group. 


Example 4.1.20 Let P denote the set of roots of unity. Thus, P {z € C | z” = 
1 for some n € Z}. The multiplication of complex numbers induces a multiplication 
in P with respect to which it is a group (verify). This is also an infinite abelian group. 


Example 4.1.21 Letn € N. Let P, denote the set of nth roots of unity. Thus, P, = 
{en | 0 < r <_ n}. The multiplication of complex numbers again induces a 
multiplication in P,, with respect to which it is an abelian group of order n (verify). 


Example 4.1.22 (Klein’s four group). Let V; = {e, a, b,c}. Define a binary oper- 
ation o on V4 by the following table: 


QLs ora eo 
RIQLT Saya 


QD] SARL eR] 


a 
a 
e 
Cc 
b 


e 
e 
a 
b 
c 


It can be checked that o is associative. Clearly, e is the identity, and every element is 
its own inverse. Thus, (V4, 0) is a group called the Klein’s four group (after the name 
of the great geometer Felix Klein). This group is a finite abelian group containing 
four elements in which every element is its own inverse. 


So far, we had examples of abelian groups only. Following few examples are those 
of nonabelian groups. 


Example 4.1.23 Let Qg = {1,—1,i, j,k, —i, —j, —k}. Define a multiplication - 
in Qs as follows: | acts as the identity of the operation, multiplication by -1 changes 
sign, v= f = k* = (-i) = (-j)? = (—k)? = —1,and the multiplication 
between i, j,k, —i, —j, —k is obtained by treating them as unit vectors along the 
three axises and taking vector product between them. Thus, the multiplication table 
for binary operation - is given by 
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(Qs, -) is a group of order 8 (verify) called the Quaternion group of degree 8 or 
the Hamiltonian group (after the name of Hamilton). This group is nonabelian 
(i- j # j -i) group of order 8. 


Example 4.1.24 Transformation group or symmetric group. Let X be anonempty 
set. A bijective map from X to X is called a permutation on X. Let Sym(X) 
denote the set of all permutations on X. Then, Sym(X) is a group with respect to 
composition of maps. This group is called the transformation group on X or the 
symmetric group on X or the permutation group on X. If X = {1,2,...,n}, 
then Sym(X) is denoted by S, and called the symmetric group of degree n or the 
permutation group of degree n. The number of bijective maps from X to X in 
this case is n! (prove it by induction). If X contains just one element, then Sym(X) 
also contains one element, and hence, it is abelian. If X contains two elements, then 
Sym(X) contains 2! = 2 elements of which one is the identity element, and hence, 
in this case also, Sym(X) is abelian. Suppose that X contains at least three elements. 
Let a,b,c be distinct elements of X. Define a map f from X to X as follows: 
f(a) = b, f(b) = c, f(c) = a,and f(x) = x whenever x ¢ {a, b, c}. Clearly, 
f € Sym(X). Let g be another map from X to X given by g(a) = b, g(b) = a, 
and g(x) = x whenever x ¢ {a,b}. Then, (gof)(a) = aand(fog)(a) = c. 
Hence, gof # fog. Thus, in this case, Sym(X) is nonabelian. 


Remark 4.1.25. There are two main resources of groups, viz. the symmetric groups 
and the matrix groups. The symmetric group will be studied in detail in Sects. 6.1 
and 6.2 of Chap. 6. The matrix groups will be introduced and studied in the Sects. 6.3 
and 6.4 of the Chap. 6. 


Example 4.1.26 (Group of isometries or group of motions). Consider the Euclid- 
ean space R*. The distance d(x, y) between two points x = (x), x2,x3) and 
y = (1, y2, y3) of R?3 is given by 


d(x,y) = +7 (1 — yi)? + Ga — yo)? + (3 — ys)? 
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A bijective map! f from R? to R? is called an isometry also called a rigid motion 
if d(f(x), f(y)) = d(x, y) for all x, y € R*. Let Iso(R*) denote the set of all 
isometries of R*. Then, /so(R?) is a group with respect to composition of maps, and 
it is called the isometry group of R° or the group of rigid motions. 


Example 4.1.27 Group of symmetries of geometric objects. Let X C R°? (e.g., X 
may be a regular tetrahedron or X may be an ellipsoid, sphere, or any object). Let 


Tsox(R*) = {f € Iso(R%) | f(X) = X} € Tso(R*) 


Then, Jsox(R*) is a group with respect to composition of maps. This group is called 
the group of symmetries of X. This group measures the symmetry of X. More 
symmetrical is the geometrical object X, larger is the group of isometries of X. 


Example 4.1.28 Group of symmetries of plane geometric figures. Consider the 
group Jso(R?) of isometries of R?. Thus, Jso(R7) is the set of all bijective (the 
condition of being bijective is redundant) distance preserving maps from R? to R?, 
and it is a group under composition of maps. There are three types of fundamental 
isometries of R*: (i) rotations about different points, (ii) reflections about different 
lines in R?, and (iii) translations (write down transformations representing these 
isometries). It is a fact (prove it or see algebra 2) that every isometry of R? is 
obtained by composing these fundamental isometries. In fact, an isometry of R? is 
either a rotation or a reflection or a translation or a composition of a reflection and a 
translation. 

Let X be a bounded subset of R? (e.g., a circle, an ellipse, a triangle, or a polygon). 
Then, translation or composition of translations and reflections can not leave X 
invariant. Thus, only rotations and reflections may belong to the group Jsox (R7) of 
isometries of X. Let us describe the group Jsox (R*), where X is a regular polygon 
of n sides, n > 3. Since X is bounded, only rotations and reflections can belong to 
Tsox(R?). Suppose that n is even. The rotations through angles 2EL O<r<n 
about the center of the polygon are the only rotations which keep X invariant. The 
reflections which keep X invariant are reflections in lines joining opposite vertices 
of the regular polygon and also reflections in the line joining middle points of the 
opposite sides. These are also n in number. Thus, sox (IR?) contains exactly 2n 
elements. 

Next, suppose that n is odd. Again, rotations through angles 2a O<r<n 
about the center are the only rotations which keep X invariant. The reflections in lines 
joining vertices with middle points of their opposite edges are the only reflections 
which keep X invariant. Thus, again Jsox (R) contains 2n elements. This group (in 
both cases) is called the dihedral group and is denoted by D,,. The structure of this 
group will be studied later. 


'The condition of f being bijective is redundant. In fact, if a map f from R? to R? satisfies the 
condition d( f(x), f(v)) = d(x, y) forall x, y € R3, then as a consequence f is bijective. 
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Exercises 


4.1.1 Let G be a set containing n elements. Find the number of binary operations 
on G. How many of them are commutative? 


4.1.2 Define a binary operation o on Z by xoy =| x — y |. Show that o is commu- 
tative but not associative. 


4.1.3. Define a binary operation o on Z by xoy = x + y—x-y, where + and - are 
usual addition and multiplication in Z. Show that (Z, o) is a semigroup. Is it a group? 


4.1.4 Let Y be a set. Let G be the set of all maps from Y to Y. Then, (G, 0) isa 
semigroup with identity. Let f be a surjective map on Y. Let g € G. Show that the 
equation foX = g is solvable, where X is unknown in the equation. If f € G is 
injective, then show that Xof = g is solvable. Is (G, 0) a group? 


4.1.5 Let (G, 0) be a group. Define another binary operation o’ on G by 

xo'y = y loxoy” 
Show that (G, o’) is a groupoid in which equations Xo’x = yandxo'X = y have 
unique solutions for all x, y € G. Show that if o’ is commutative, then o' = o. 


Show that (G, o’) may be a group even if (G, 0) is nonabelian. Find a necessary and 
sufficient condition so that it becomes a group. 


4.1.6 Describe the group of symmetry of an isosceles triangle. 
4.1.7 Describe the group of symmetry of a square. 


4.1.8 Describe the group of symmetry of a rectangle which is not a square and 
observe that square is more symmetrical. 


4.1.9 Describe the group Jsox (IR*), where X is a regular tetrahedron with origin as 
centroid. How many elements in this group are there? 


4.1.10 Show that Isos:(R?) = {fg : R*? —> R?|0€R} U) {po : R? —> R? | 
@ € R}, where fy is the map defined by 


fox, y)) = (x cos6 + ysin@ —x sin? + ycos®@) 
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and ppg is the map defined by 

po((x, y)) = (x cosO + ysin@ —x sin? — ycosé@) 
4.1.11 Describe the group of symmetries of the ellipse 


x y 
4 9 


4.1.12 Describe the group of symmetries of Z. 


4.1.13 Describe the group of symmetries of the unit sphere in R* with center as 
origin. 


4.1.14 Let (G, 0) be a group and X a nonempty set. Let G* denote the set of all 
maps from X to G. Define a binary operation « on G* by (f *g)(x) = f(x)og(x) 
for all x € X. Show that (G*, «) is a group. Show that it is commutative if and only 
is G is commutative. 


4.1.15 Let (G,, 0;) and (G2, 02) be groups. Define a binary operation * on G; x G2 
by (a,b) x (c,d) = (ao,c, bo2d). Show that (G; x G2, *) is a group called the 
external direct product of G, and Go. 


4.1.16 LetG = Q — {I}. Define a binary operation 0 on G by aob = a+b—ab, 
where + and - are usual addition and multiplications in Q. Show that (G, 0) is a 
group. What is the identity and what is the inverse of an element a in G? 


4.1.17 Let (G, o) be an abelian group and c € G. Define a binary operation * on G 
by ax b = (aob)oc™'. Show that (G, *) is a group. What is its identity and what 
is the inverse of an element a € G? What happens if we drop the condition of the 
group being abelian? 


4.1.18 Let (G, 0) be a group and f a bijective map from X to G. Define a binary 
operation * on X by the requirement that f(x* y) = f(x)of(y). Show that (X, «) 
is a group. 


4.1.19 Let X be a nonempty finite set. Show that we can always define a binary 
operation on X so that it becomes a group. 
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4.1.20 Show that (P(X), A) is an abelian group in which every element is its own 
inverse. 


4.1.21* Let X be an infinite set and Y the set of finite subsets of X. Show that there 
is a bijective map from X to Y. Show that (Y, A) is an abelian group. Use it to show 
that every nonempty set can be given an abelian group structure. Deduce that there 
is no set containing all abelian groups. 


4.1.22 Let p be a positive prime. Let G, denote the set of all complex numbers 
which are p power roots of unity. Show that the multiplication of complex numbers 
induces a multiplication in G, with respect to which it is an infinite abelian group. 


4.1.23 Examine whether the set (ie | m,n € Z} of rational numbers form a 
group with respect to the multiplication of rational numbers. 


4.1.24 Show that the set {7 | m,n € Z,n 4 Oand (n, p) ~ 1}, where p isa given 
prime, is a group with respect to the usual addition of rational numbers. 


4.1.25 Let (G, 0) be a group. Define a binary operation « on G by 
Xey= yoxoy? 


Show that (G, *) is a group if and only if every element of the form y~?0 (x0 y)?0x~? 


commutes with each element of G. 


4.1.26 Let 
S= {dag + ayi + dj + a3k | ao, ay, a2, a3 €Randa+ai+a5t+a; = |} 
This set is called the set of unit Quaternions (observe that this set can also be identified 
with the unit sphere in the Minkowski space R*). Define a multiplication * in S? as 
follows: 
(do + at + ayj + agk) ® (bo + byt + boj + b3k) = cot cyi + crf +03k 
where co = dgbo = a,b, = anb2 = a3b3, C= dob + abo + ab; = a3bp, Q = 


dob2 + darby — ayb3 +.a3b; andc3 = agb3 + a3bo + ayb2 — azb,. Show that (S?, x) 
is a nonabelian group (compare this group with circle group). 


4.1.27 Find the inverse of 7 in Ujo. 
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4.1.28 Find the inverse of 10 in U>,. 


4.1.29 Find the inverse of 250 in Uoay. 


4.1.30 Find the order of U¢4o. 


4.2 Properties of Groups 


As already observed, identity of a group is unique. The inverse of an element a in 
a group (G, 0) is unique and is denoted by a~!. If the binary operation is written 
(which we usually do when the operation is commutative) additively, then the identity 
is denoted by 0 (called zero) and the inverse of an element a is denoted by —a (called 
the negative of a). 


Proposition 4.2.1 Jn a group (G, 0), (a~')~! = aand(aob)~! = b~'oa™! for 
alla,beG. 


Proof Since the inverse of an element in a group is unique, it is sufficient to observe 


1 —-1 


a oad = e = aoa 
and 
(b-'oa™!)o(aob) = e = (aob)o(b~!oa~') 
The second observation follows from the associativity of the binary operation. 


Proposition 4.2.2 Cancelation law holds in a group (G, 0) in the sense that 


(aob = aoc) => b 


ll 
fay 


and 
(boa = coa)=>b=c 


Proof Suppose that aob = aoc. Then, 
b = eob = (a'oa)ob = a~‘o(aob) = a~'o(aoc) = (a~'oa)oc = eoc = c¢ 


Second part follows similarly. tt 


Proposition 4.2.3 Let (G, 0) be a finite semigroup in which cancelation law holds. 
Suppose that G & Y. Then, (G, 0) is a group. 
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Proof Itis sufficient to show that equations aoX = band Xoa = b have solutions 
in G for all a,b € G. Leta € G. Define a map L, from G to G (called the left 
multiplication by a) by La(g) = aog. Suppose that La(g;) = La(g2). Then, 
aog, = 40g2. Since (G, 0) satisfies cancelation law, gj = 4g. Hence, Ly is 
injective. Since G is finite, L, is also surjective. Thus, given an element b in G, there 
is an element c in G such that L,(c) = b. This means that aoc = b. Similarly, 
considering the right multiplication R, by a, we can show that equations Xoa = b 
is solvable for all a, b € G. tt 


Remark 4.2.4 (N, +) is an infinite semigroup in which cancelation law holds, but 
(N, +) is not a group. Thus, the finiteness condition in the above proposition is 
essential. 


Remark 4.2.5 The associativity of a binary operation o in a groupoid implies that 
product of a finite sequence a), a2, ..., a, taken in same order is independent of the 
manner in which we put parenthesis. If in addition it is commutative, then the product 
is independent of the order also. A precise proof of the above assertion follows by 
induction on n. 


Integral Powers of Elements of a Group 


Let (G, 0) be a group and a € G. We first define nonnegative integral powers of a. 
This we do by induction. Define a® = e, the identity of the group. Assuming that 
a” has already been defined, define a"*! = q"oa. If n is negative, then a~” has 
already been defined, and then, define a” = (a —")-l| Thus, all integral powers of 
a have been defined. Clearly, forn > 0, 


aq = aodo:::oa 
net! 


and 


Proposition 4.2.6 (Law of Exponents) Let (G, 0) be a group anda € G. Then, 


(i) qiutm — a"oa™ 
(ii) (a")" = a” foralln,m € Z. 


Proof The proof for n > 0, m > O follows by induction on m and is left as an 


exercise. If n < 0, m < 0, then from the previous case, a = a "oa", or 
a"+™ = a-"oa—", Taking the inverses, 
quem = (a my lola m) i: = a"oa™ 


Next, suppose thatn > Oandm < 0.Then,n+m > Oorn-+m < 0. Suppose first that 


n-+m > 0. Then, from what we have just proved, it follows that a"t"oa~" = a", 
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and hence, a"t" = a"oa™. The case n + m < 0 follows similarly. This completes 


the proof of (i). Similarly, by induction on m, we can prove the (ii). tt 


Remark 4.2.7 Let (G, 0) be a group and a € G. Then, 


(i) aoa” = qutm = qutn = a” oa" 
(ii) (a")™ =qrm — qnn — (a”)" 


Thus, integral powers of an element commute with each other. 


Illustrations 

2.1. Let (G, 0) be a group and a € G. Suppose thata” = e = a", mAO0Fn. 
Then a4 = e, where d is g.c.d of m and n. In particular, if m and n are co-prime, 
thena =e. 


Proof By the Euclidean algorithm, there exist u,v € Zsuch thatd = um + vn. 
Hence, using the law of exponents, we get that 


até = quintun = a" oq?" = (a”)"o(a")” = eoe” =e. tt 


2.2. Let (G, o) be a group in which every element is its own inverse or equivalently 
a’ = e forall a € G. Then, (G, o) is abelian. 


Proof aob = (aob)~' (by the hypothesis) = b~'oa~! (by Proposition 4.2.1) = boa 
(by the hypothesis) for all a, b € G. This proves that (G, 0) is abelian. tt 


2.3. Let (G, 0) be a group such that (aob)? = a*ob* for all a,b € G. Then, 
(G, o) is abelian. 


Proof aoaobob = a?ob* = (aob)? = aoboaob Va,b € G. By the cancelation 
law, aob = boaVa,beG. tt 


Remark 4.2.8 1. A group in which a* = e for all members a of the group may 
not be abelian. 


2. A group G in which (aob)?_ = aob? for all a,b € G need not be abelian. 
In fact, for each n > 3, there is a nonabelian group in which a” = e (and so 
(aob)" = a”ob” for alla, b) for all a in the group. Example to support this will 
be given later. 


2.4. Let (G, o) be a group and n an integer such that (aob)” = a™ob" for all 
m €{n,n+1,n+ 2}, and alla, b € G. Then, (G, o) is an abelian group. 


Proof aobo(aob)" = (aob)"t'! = a®*'ob"*!, Since (aob)" = a”ob", we have 
aoboa"ob" = a"*'ob"*", By the cancelation law, we get 


boa" = a"ob (4.2.1) 
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for alla, b € G. Using the same argument by putting n + | at the place of n, we get 
boa"! = a"*'ob (4.2.2) 
for all a, b € G. Further, using the above two equations, we get 
a"*!ob = boat! = boa"oa = a"oboa 


for all a, b € G. Canceling a” from left, we get that aob = boa for alla,b€G.f 


2.5. Let (G, o) be a finite group containing even number of elements. Then, there 
isaanelementa € G,a # e such that a! = a(ie.,a? = e). Further, odd number 
of such elements exists. 


Proof Define a relation ~ on G by 
a ® bifand only ifa = bora = b™' 


It is easily seen that ~ is an equivalence relation. The equivalence class determined 
by a is {a,a~'}. It is singleton, if a = a', and otherwise, it is doubleton. Let 
S be the union of all those equivalence classes which are doubletons. Then, S is 
disjoint union of doubletons, and hence, it contains even number of elements 2m 
(say). Clearly, e ¢ S. Thus, S fe} contains 2m + 1 elements. Since G contains 
even number of elements, G — (SUf{e}) = {ae G|af#eanda' = ajis 
nonempty and contains odd number of elements. tt 


2.6. Converse of 2.5 is also true: Let (G, 0) be a finite group in which there is an 
element a 4 e such thata = a7! (a? = e). Then, G contains even number of 
elements. 


Proof Leta € G,a #eand a* = e. Define a relation ~ on G by 
x © yifand only ifx = yorx = ay 


It is easy to see that this is an equivalence relation and the equivalence class deter- 
mined by x is {x, ax}. Clearly, x # ax, and so each equivalence class contains 2 
elements. Since G is union of disjoint equivalence classes, it contains 2n elements 
for some n. tt 


2.7. Let (G, o) be a group and | G |< 4. Then, G is abelian. 


Proof If| G |= 1,thenG = {fe}, and nothing to do. If | G | = 2, then it contains 
two elements of which one is identity, and again, there is nothing to do. Suppose that 


|G |= 3.Then,G = {e,a,b}, where e is the identity element and a and b are 
distinct nonidentity elements of the group. We need to show that aob = boa. Now, 
aob # a, for otherwise b = e. Similarly, aob 4 b, for otherwise a = e. Thus, 


aob = e. Similarly, boa = e. Hence, in this case, also G is abelian. 
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Next, suppose that | G | = 4.LetG = {e,a,b,c}, where e is the identity of 
G and a, b,c are distinct non identity elements of G. Since G is of even order, by 
illustration 2.5, it contains odd number of nonidentity elements which are their own 
inverses. There are two cases: 


(i) a7! = a, b-'! = bande"! =c 
(ii) Only one of a, b, c is its own inverse. 


In case (i), every element of G is its own inverse and so, by illustration 2.2, G 
is abelian (in fact, in this case, it is left as an exercise to prove that the group is the 
Klein’s four group). Consider the case (ii). Without any loss we can assume that 
a! = aandb"! £b, c“! 4c. Now, ab ¥ e, for otherwise b = a~'! = a. Next, 
ab # a, for otherwise b = e. Also, ab 4 b, for otherwise a = e. Thus,ab = c. 
Consider b*. Clearly, b? 4 e, for otherwise b~' = b. b* # b, for otherwise b = e. 
Also b? = c implies that b? = ab. This, in turn, implies thata = b. Thus, be=a 
and b? = ab = c. This shows thatG = {fe = b®, b, b*, b*}, and so it is abelian 
(see Remark 2.3). tt 


An Application to Number Theory 


Theorem 4.2.9 (Wilson Theorem). Let p be a positive prime. Then 
(p—! + 1 =O0(nod p) 


Equivalently (p — 1)! — (p — 1) is divisible by p (if we divide (p — 1)! by p the 
remainder is p — 1). 


Proof Consider the group (U,, x) of prime residue classes modulo p. This is an 


abelian group. Leta ¢ U,, 1 < a < p—1 be its own inverse. Then, @2=a@ =. 
This is equivalent to say that p divides a? —1 = (a—1)(a+1). Since p is prime, p 
divides a — 1 or p dividesa+1.Sincel1 <a < p—1, a = lora = p—1.Thus, 
1 and p — 1 are the only elements in U p Which are their own inverses. If we take the 
product of all elements in U, (observe that the order will not matter for the group is 
commutative), the elements which are not their own inverses will cancel with their 
inverses, and we get, 


Thus, (p— 1)! = p—1 = —1. Equivalently, (p —1)!+1 = 0 in U,. This 
means that 
(p — 1)! + 1 =O(mod p). tt 


Converse of the Wilson, the theorem is also true: 


Theorem 4.2.10 Letn > 1. Suppose that (n — 1)! + 1 = O(mod n). Then n is 
prime. 
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Proof Suppose that n is not prime. Then,n = r-s, where 1 < r < n. Clearly, r 
divides (n—1)!. Since n is supposed to divide (n—1)! + 1, r also divides (n—1)! + 1. 
This is impossible. tt 
As an application, we find that | is the remainder obtained when 28! x 30 + 2 
is divided by 29. For, in Z.9, 28! x 30 + 2 = 28!*%306 2 = -1l*1@2 = 1 
(by Wilson theorem, 28! = —1). _ = 
Similarly, when we divide 27! by 29, the remainder is 1. For, —1 = 28! = 
27! «28 = —(27!) (observe that 28 = —1). 
Exercises 


4.2.1 Show that a group (G, 0) is abelian if and only if (aob)~! = a~'ob™' forall 
a,beG. 


4.2.2 Give an example of a semigroup in which left cancelation law holds but right 
cancelation does not hold. 


4.2.3 In the group (Z, +), show that a” isn -a for alln € Z anda € Z, where - 
denotes the usual product of integers. In particular, 1" = n forall n € Z (observe 
that the power is taken with respect to +). 


4.2.4 Let (G, 0) be a group. Suppose that a” = a” = a" = e.Showthata’ = e, 
where d is g.c.d of m,n, andr. 


4.2.5 Let (G, 0) be a group and a, b € G. Show that (boaob™!)" = boa"ob™ for 
alln e€ Z. 


4.2.6 Let (G, 0) be a group and a, b € G. Suppose that boaob~! = a’. Show that 
(i) D"™oaob-™ = a”) 
(ii) b"™ oatob-™ = qt 


for all integers m, r, and ft. 


3 


> 


4.2.7 Let (G, 0) be a group. Let a,b € G such that a> = e = 
boaob-! = a*. Show thata = e. 


. Suppose that 


4.2.8 Let (G, 0) be a group. Find solutions of equations 


(i) aoXob? = b 


(ii) aoXoa* = b 
(iii) aoXoaoX = box 
4.2.9 Show that in a group the equation X= e has even number of solutions. 


Show that there may be more than two solutions. 
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4.2.10 Do we always have solutions of equation XoaoX = bina group (G, 0)? 
Support. 


4.2.11 Show that every group of order 5 is abelian. 


4.2.12 Let X be a finite set. Let G be a nonempty subset of Sym(X) such that 
fog € G forall f,g € G. Show that G is a group with respect to composition of 
maps. 


4.2.13 Show that if p is prime, then (p — 2)! — 1 is divisible by p. 
4.2.14 Find the remainder when 18! x 17! + 3 is divided by 19. 
4.2.15 Find the remainder when 100! is divided by 101. 


4.2.16 Let (G, 0) be a finite group containing n elements. Let a € G. Show that 
there exists m, 1 <m <nsuchthata” = e. 


4.2.17 Let p be a prime number. Let a be a nonidentity element of a group such 
thata? = e.Letie N, 1 <i < p.Show thata' ¥ e. Deduce that in this case if 
G is finite, then the number of elements in G is a multiple of p. 

Hint. Define a relation ¥ on G by x * y => x = a'y forsomei,0 <i < p. 
Observe that it is an equivalence relation. Look at the equivalence classes. 


4.2.18 Let (G, o) bea finite group of odd order such that 3 does not divide the order 
of the group. Suppose that (aob)* = a?ob? for all a, b € G. Show that G is abelian. 
Hint. Show that the maps a ~» a? anda ~~ a? are injective, and so surjective. Further, 
use cancelation law to show that a20b? = b3o0a* for alla,beG. 


4.2.19 Show by means of an example that the conclusion of Exercise 4.2.17 need 
not hold if p is not assumed to be prime. 


4.2.20 Suppose that m is the smallest positive integer such that a” = e (sucham 
if exists is called the order of a). Show that the conclusion of Exercise 4.2.17 holds 
good if p is replaced by m (observe by means of an example that such a number m 
need not be prime). 


4.2.21 Let (G, 0) be a finite group of odd order. Suppose further that (aob)? = 
aob? and (aob)? = b> oa? foralla, b € G. Show that (G, 0) is abelian. 


4.2.22 Let (G, 0) be a finite abelian group. Let a be the product of all elements of 
G. Show that a” = e. 
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4.3 Homomorphisms and Isomorphisms 


Whenever we have some mathematical object, the first and the foremost thing to 
study that object is to make it explicit as to when we are going to identify such 
mathematical objects. For example, to study triangles, the place where the triangle is 
situated in the space is immaterial. More precisely, two triangles A; and A) are taken 
to be same if there is a rigid motion o which takes A; to Az. We identify congruent 
triangles. If we are interested only in the shape but not the size of the triangles, then 
we identify similar triangles. In groups, we identify two groups (G1, 0;) and (G2, 02) 
if they are isomorphic in the sense that there is a bijective map f from G; to G2 
which preserves binary operations (i.e., f(ao,b) = f(a)o2 f(b) Va, b € G,). More 
generally, we have the following: 


Definition 4.3.1 Let (G,, 0,) and (G2, 02) be groups. A map f from G, to G2 is 
called a homomorphism if 


f(ao\b) = f(a)o2 f(b) 


for alla, b € G,. 

An injective homomorphism is called a monomorphism. A surjective homomor- 
phism is called an epimorphism. A bijective homomorphism is called an isomor- 
phism. A homomorphism from a group (G, 0) to itself is called an endomorphism. 
An isomorphism from (G, 0) to itself is called an automorphism of the group. 


Remark 4.3.2. Every branch of mathematics, apart from having its applications in 
other branches of knowledge, has some of its own guiding problems. The researches 
in that branch are centered around these problems. Of course, one never dreams of 
solving the problem completely. However, in attempts to solve the problem partially, 
one develops literature and tools in the subject, and this is how the subject develops. 
The main problem in group theory is to classify groups up to isomorphism. For 
example, one may ask: ‘How many nonisomorphic groups of order n are there?’ and 
“what are they?’ Theory of finite groups is centered around this problem. 


Proposition 4.3.3 Let (G,, 01) and (G2, 02) be groups and f a homomorphism 
from G, to G3. Let e, be the identity of (G1, 01) and ep the identity of (G2, 02). Then, 


(i) fle) = e2, 
(ii) fia) = (f(a))"! foralla € G,, 
(iii) f(a") = (f(@))" forall a € G, and n € Z, and 
(iv) f(aj'oiay’o,...o\a7") = f(ay)"02 f (az) 02...02f (a) 
for all ay, dz,...,a, € Gj andnj,n2,...,n,; € Z. 


Proof Since e2, e; are identities in the corresponding groups and f is a homomor- 
phism, 
erorf(ei) = flei) = fleioie:) = flei)orf (er). 
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By the cancelation law in a group, e2 = f (e;). This proves (i). Further, 


f(a"')oo f(a) = f(a oa) = fle) = & = f(a 'of a. 


By the cancelation law, f (a!) = f(a)~!. This proves (ii). The proof of (iii) for 
n > 0 follows by induction and the fact that f is a homomorphism. Suppose that 
n = —m,wherem > 0. Then f(a") = f(a”) = f((a")!) = (f(a™)“!) (by 
(ii) =(f@")"! = f@-™" = f(a". This proves (iii). The proof of (iv) follows 
from (iii) and the induction on r. tt 


Proposition 4.3.4 Composite of any two homomorphisms is a homomorphism. 


Proof Let f be a homomorphism from a group (G1, 0;) to a group (G2, 02) and g 
a homomorphism from (G2, 02) to (G3, 03). Then, 


(gof)(aoib) = g(flaoib)) = g(flajorf(b)) = g(f(a)osg(f(b)) = 
(gof)(a)o3(gof)(b). 


Hence, gof is a homomorphism. ft 


Since the composite of injective maps is injective, the composite of surjective 
maps is surjective, and the composite of bijective maps is bijective, it follows that the 
composite of monomorphisms is a monomorphism, the composite of epimorphisms 
is an epimorphism, and the composite of isomorphisms is an isomorphism. 


Proposition 4.3.5 If f is an isomorphism froma group (Gj, 01) to a group (Ga, 02), 
then fc! is also an isomorphism from (G2, 02) to (G1, 01). 


Proof Since f is bijective, f~' is also bijective. It is sufficient, therefore, to show 
that f—! is also a homomorphism. Since f is a homomorphism, 


ff 'Oaf'@) = ff O)of(f'@) = cord 


for all c, d € Go. Further, since f is bijective 


f~'(Cord) = fr") fd) 
for all c, d € Gp. This shows that f~! is an isomorphism. tt 


It follows from the above propositions that the relation ‘is isomorphic to’ is an 
equivalence relation on any set of groups. The notation G; * Gy will stand to say 
that G; is isomorphic to Go. 


Remark 4.3.6 The counterpart of homomorphisms in the category of metric spaces 
(topological spaces) are continuous maps. The reader may note the difference 
between an algebraic category and a topological category by observing that the 
inverse of a bijective continuous map need not be a continuous map. 
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Example 4.3.7 Let (G;, 01) and (G2, 02) be groups. The constant map from G, to 
G2 which maps each element of G, to the identity e2 of Gz is ahomomorphism. This 
homomorphism is called the trivial homomorphism or zero homomorphism. 


Example 4.3.8 Let (G, 0) be a group. The identity map Jg on G is clearly an iso- 
morphism. Thus, it is also an automorphism of (G, o). 


Example 4.3.9 Let m € Z. The map f,, from Z to Z given by fi,(a) = ma is 
a homomorphism from the additive group (Z, +) to itself(verify). We show that 
every homomorphism from (Z,+) to itself is f,, for some m in Z. Let f be a 
homomorphism from (Z, +) to itself. Suppose that f(1) = m. Then, as observed 
earlier, n is the nth additive power of 1 in the group (Z, +). From Proposition 4.3.3, 
it follows that f (7) is the nth additive power of f(1) = m. Thus, f(n) = nm, and 
so f = fm. Clearly, fi, is the zero homomorphism if m = 0, and it is injective 
if m 4 0. This also shows that a homomorphism from (Z, +) to itself is either zero 
homomorphism or a monomorphism. Further, the map m ~» f,, is a bijective map 
from Z to the set End(Z, +) of all endomorphisms of the additive group (Z, +) of 
integers. Note that f,, is an automorphism if and only ifm = +1. Also, observe 
that End(Z, +) is a semigroup with respect to composition of maps and m ~» fi, is 
an isomorphism from the semigroup (Z, -) to End (Z, +). 


Example 4.3.10 Let (R, +) denote the additive group of real numbers and (R*, -) 
the multiplicative group of positive real numbers. Let a be a positive real number. 
Define a map f, : R —> R* by f(x) = a*. Then, it follows from the law of 
exponents that f, is ahomomorphism. Also, fy is bijective, and the inverse of f; is 
the map /og, which maps y to log, y. Thus, f is an isomorphism, and so the groups 
(R, +) and (R’, -) are isomorphic. 


Example 4.3.11 The map f : R —> S! defined by f(x) = e” is a homomor- 
phism(law of exponents). Further, if z = a+b. € S', thena® + b? = 1. 
Hence, there is an angle 9 € R such thata = cosé,b = sin@. But, then 
z = cosO + usind = e” = f (0). This shows that f is a surjective homomor- 
phism. 


Example 4.3.12 Let (G, 0) be a group anda € G. Define a map f, from Z to G by 
f(n) = a”. Then, from the law of exponents, it follows that f, is ahomomorphism 
with f,(1) = a. This homomorphism, therefore, is completely determined by its 
image on 1. Thus, a ~» f, defines a bijective map from G to the set Hom(Z, G) of 
all homomorphisms from the additive group of integers to the group G. 


Illustrations 


3.1. Any homomorphism from the additive group (Q, +) of rational numbers to itself 
is multiplication by a rational number. In particular, it is a zero homomorphism or it 
is an isomorphism. 


Proof Let f be ahomomorphism from (Q, +) to itself. Let f(1) = r € Q. Then, 
as in Example 4.3.9, f(m) = rm for allm € Z. Let m 4 0 and f(4) = t. Then, 
ifm > 0, 
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1 1 1 
fQ) = fi) + f(—) +--+ f(—) 
m m m 


= 
m 


andifm < 0, 
1 1 1 
FQ) = f(-=) + f(-=) $+ + f(-=) 
m m m 


—_—_—_—_—_—_—_—S ———— 


—m 


This means thatr = f(1) = mt. Intum,f = f(4) = r- +. Hence, 
fG) = n- f(4) =n-> = r-*. Thus, f(s) = r-s foralls ¢ Q If 
r = 0, then f is a zero homomorphism. If r ¥ 0, then it is bijective, and so it is an 
isomorphism. ft 


3.2. Let (G, 0) be a finite group. Then the only homomorphism from (G, 0) to 
(Z, +) (or to (Q, +) or to (R, +)) is the zero homomorphism. In particular, there is 
no nontrivial homomorphism from the additive group Z,, of residue classes modulo 
m to any of the groups (Z, +), (Q, +) or (R, +). 


Proof Let f be a homomorphism from G to Z. Suppose that | G |= n. Leta € G. 


Then there exists m, 1 <m <nsuchthata” = e (see ex 4.2.16). Since f isa 
homomorphism,0 = f(e) = f(a") = m.- f(a). Sincem 40, f(a) = 0. This 
shows that f is a zero homomorphism. tt 


3.3. There is no nontrivial homomorphism from 


(i) (Q, +) to (Z, +) 
(ii) (Q, +) to (Zn, ®) 
(ii) (Q, +) to (Q", -). 


In particular, these pair of groups are not isomorphic. 


Proof Any homomorphism from (Q, +) to (Z, +) can viewed as a homomorphism 
from (Q, +) to (Q, +) such that the image is contained in Z. From the illustration 3.1, 
it follows that any such homomorphism is multiplication by a rational number. The 
result (1) follows if we note that there is no nonzero rational number such that all ratio- 
nal multiples of that rational numbers are integers. For (ii), let f be a homomorphism 
from (Q, +) to (Zm, ®). Letr ¢ Q. Then f(r) = m- f(2) = mu = mu = 0. 
This proves that f is the trivial homomorphism. Further, to prove (iii) let f be a 
homomorphism from (Q, +) to (Q*, -). Let r € Q. Then, since f is a homomor- 
phism, f(r) = f()”. Thus, f(5) = f(r)m € Q for all m € Z. Since there is 
no rational number except | all of whose roots are rational, it follows that f(r) = 1 
forallreQ. tt 


3.4. There is no homomorphism f from (Z, +) to (Q*, -) such that f(2) = t 
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Proof lf f(A) = r,then f(@2) = r= i. Since there is no rational number r 
such that r? = i, it follows that there is no such homomorphism. tt 


Recall that an endomorphism of a group (G, o) is ahomomorphism from (G, 0) 
to itself. End(G, o) denotes the set of all endomorphisms of the group (G, 0). Since 
composition of homomorphisms is homomorphisms, End(G, o) is a semigroup with 
identity with respect to the composition of maps. Further, an automorphism of the 
group (G, 0) is an isomorphism from (G, 0) to itself. Aut(G, o) denotes the set of 
all automorphisms of (G, 0) which is a group (composition of automorphisms is 
automorphisms, inverse of an automorphism is an automorphism, and identity map 
on G is also an automorphism) with respect to composition of maps. We compute 
End(G, 0) and Aut(G, o) for some groups. 

3.5. (1) End(Z, +) & (Z,-) as a semigroup. 

(ii) Aut(Z,+) ~ ({1, —1}, -) (note that {1, —1} is a group with respect to mul- 
tiplication). 


(iii) End(Q, +) © (Q,-) as a semigroup. 
(iv) Aut(Q,+) ~ (Q*,-). 
(v) End(Zm,®) © (Zm,*) as a semigroup. 


(vi) Aut(Zm,®) * (Um, *)- 


Proof (i) Define a map 7 : End(Z,+) — Zby 7n(f) = fd). It follows 
from Example 4.3.9 that 7 is bijective and n(gof) = gof(l) = g(fU)) = 
gA)- fd) = n(g)- Cf). Thus, 7 is an isomorphism. 

(ii) Let f € Aut(Z, +). Then, again by Example 4.3.9, f(a) = a- f(1). Since 
f is bijective, f(1) = +1. Thus, 7 defined in (1) when restricted to Aut(Z, +) 
defines an isomorphism from Aut(Z, +) to the group ({1, —1}, -). 

The proof of (iii) is similar to that of (i), and the proof of (iv) is similar to that 
of (ii) provided we note that multiplication by a rational number r from Q to Q is 
bijective if and only if r 4 0. 

(v) Define a map 7 from End(Zm,®) to Zm by n(f) = f(1). Suppose that 
n(f) = n(g). Then, f(1) = g(1). But, then 


fF) = fiIele- Oo) = fHNEFOO~ OG f0)) = 
gQ) 6g) @--- Og) = gL G10--- G1) = gf) 


r r 


Thus, f = g. This shows that 7) is injective. Next, letr € Z,,. Define a map f from 
Zm to Zm by f(a) = rxa. Then, f € End(Z,,, ®) (verify) and7(f) = 7. Hence, 
7 is also surjective. Further, 


n(gof) = gof() = g(f()) = gQ)* f(D) = ng) «n(f) 
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This proves that 7 is an isomorphism. 

(vi) We first show that an endomorphism f € End(Z,,, @) defined by f(a) = 
r xa is an automorphism if and only if 7 € U,,. Suppose that r € U,,. Then, there 
existsas € U» such that7*s = 1.Defineamap g from Z,, to itself by g(@) = 5a. 
Then, 


gof@) = g(f@) = gFxa) = S*F xa = 1*4 =4 


Thus, gof is the identity map. Similarly, it follows that fog is also the identity map. 
Hence, f is an automorphism. It follows that the map 7 defined in (v) induces an 
isomorphism from Aut(Z», ®) to (Un, *). tt 


3.6. Let (G,o) be a group and a ¢€ G. Define a map f, from G to G by 
fax) = aoxoa~'. Then, fa is an automorphism of (G, 0) (verify) called an inner 
automorphism of (G, 0) determined by a. It is easy to check that faop = faof for 
all a, b € G. This shows that the map f from the group G to the group Aut(G, o) 
defined by f(a) = f, is a homomorphism. 

3.7. Let (G1, 01) and (G2, 02) be groups. Let Hom(G,, G2) denote the set of all 
homomorphisms from G, to G2. Let f and g be members of Hom(G,, G2). Define 
amap f x g from G, to G2 by 


(fxg)\(x) = f(xjorg(x), x € Gy. 


It is easy to check that f * g € Hom(G,, G2) provided that each element of f (G1) 
commutes with each element of g(G,). Assume that (G2, 02) is an abelian group. 
Then, * defines a binary operation on Hom(G, G2), and Hom(G,, G2) is an abelian 
group with respect to x. The constant map e from G, to Go defined by e(x) = e2 is 
the identity of the group. The inverse f~' of f is given by f~'(x) = f(x)7!. The 
commutativity of x follows from that of 02. We have the following: 


(i) For all abelian group (G, 0), Hom(Z, G) ¥ G. 
(ii) Hom(Zm, Zn) ~ Za, where d is the greatest common divisor of m and n. 


Proof (i) The map 7 from Hom(Z, G) to G defined by n(f) = f(1) is easily 
seen to be an isomorphism. tt 

(11) We define amap 7 from Hom(Z,,, Z,) to Zy as follows: Let f € Hom(Z», Zn). 
Suppose that f(1) = 7 € Z,. Then, 


t= f0) = fn) = ar 


This implies that n dividesm-r.Letn = duandm = dv. Then, (u,v) = 1 


and du divides dur. Hence, u divides ur, Since (u,v) = 1, u divides r. Thus, 
- € Z. Now, define 7(f) = ca — a in Z,. If r = s in Z,, then n divides 
r — s, and so d divides a _ a Thus, 7 is a map from Hom(Z,,, Z,) to Zg. 


Next, let f,g € Hom(Z,,, Zn). Suppose that f(1) = 7 and g) = S. Then, 
(f+g9)0) = 7F@s =7r-+s. Hence, 
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n(if+g) = 


d d d 
oe ee aaa) 
n n n 


This shows that 7) is a homomorphism. Next, suppose that 


dr ds 
nf) = — = 7g = —, 
n n 
where f(1) = 7 and g(1) 5. Then, d divides “ — “. But, then — is an 


integer. This means thatr = sin Z,. In turn, 
f@ = tr = ts = gif) 


for allt € Z,,. This shows that f = g, and so77is injective. Finally, let@ € Zz. Define 
arelation f from Z,, to Z, by f(f) = oe It is easily seen that a isa map which 
is also a homomorphism. Further, since 7 and d are co-prime, f(1) = “4 = a. 
Thus, 77 is also surjective. ft 


Exercises 


4.3.1 Let (G, 0) be a group. Show that the map a ~ a7! from G to itself is a 
homomorphism if and only if the group is abelian. 


4.3.2. Show that the map a ~~ a” from a group to itself is a homomorphism if and 
only if the group is abelian. 


4.3.3, Show that the conjugation map z ~» Z from (C, +) to itself is an automor- 
phism. 


4.3.4 Show that (Us, *) ~ (Z4, ®). 
4.3.5 Show that (R*, -) is not isomorphic to the group (R, +). 


4.3.6 Find all homomorphisms from the Quaternion group Qs to the Klein’s four 
group V4 and also all homomorphisms from V4 to Qg. Do we have a monomorphism 
from V4 to Qg? 


4.3.7 Find out all homomorphisms from (Z, +) to V4. 


4.3.8 Find the number of homomorphisms from (Z\6, ®) to (Z12, @). What is the 
group Hom(Z6, Z12)? 


4.3.9 Let (G, 0) be a group and Hom(Z, G) denote the set of all homomorphisms 
from the additive group of integers to the group (G, o). Show that the map 7) from 
Hom(Z, G) to G defined by n(f) = f (1) is a bijective map. 


4.3.10 Let (G, 0) be a group and X = {a€ G|a” = e}. Define a map 77 from 
Hom(Z,,, G) to G by n(f) = f (1). Show that 7) is injective map and its image is 
X. 
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4.3.11 Show, by means of an example, that if (G2, 02) in illustration 3.7 is not 
abelian, then f x g need not belong to Hom(G,, G2) even if f and g are homomor- 
phisms. 


4.3.12 Show, by means of an example, that even if the group (G2, 02) in the illus- 
tration 3.7 is nonabelian, the product f * g of a pair of homomorphisms f and g may 
be a homomorphism. 


4.3.13 A pair f, g of homomorphisms from a group (G1, 01) to a group (G2, 02) 
is said to be summable if f * g is also a homomorphism. Show that a pair f, g of 
homomorphisms from a group (G_, 0;) to a group (G2, o2) is summable if and only 
if each element of f(G,) commutes with each element of f (G2). 


4.3.14 Show that any two endomorphisms of Qg which are not automorphisms are 
summable. Indeed, image of any endomorphism which is not an automorphism is 
trivial or else {1, —1}. 


4.3.15 Characterize groups in which all pairs of endomorphisms are summable. 


4.3.16 Show that there is no nontrivial homomorphism from the circle group S! (or 
the multiplicative group P of roots of unity) to the additive group Z of integers. 


4.3.17 Show that every continuous homomorphism from the additive group R of 
real numbers to itself is multiplication by a real number. Deduce that the group 
Hom,(R, R) of all continuous homomorphisms from R to R is isomorphic to the 
additive group R of real numbers. 


4.3.18 Show that the set of all continuous automorphisms of the additive group R 
of real numbers is a group with composition of maps which is isomorphic to the 
multiplicative group R* of nonzero real numbers. 


4.3.19 Find all continuous homomorphisms from the additive group R of real num- 
bers to the multiplicative group R* of positive real numbers. Show that the group 
Hom,(R, Rt) (with pointwise multiplication) of all continuous homomorphisms 
from the additive group R of real numbers to the multiplicative group R* of positive 
real numbers is isomorphic to the multiplicative group of positive real numbers. 


4.3.20 Show that there is no homomorphism f from (Z, +) to (Q*,-) such that 
fO6) = 2°. 


4.3.21 Find all continuous homomorphisms from S! to itself. 


4.3.22 Show that the group Hom(Qg, V4) is isomorphic to the external direct prod- 
uct V4 x V4. 


4.3.23 Find out all members of AutQg and also AutV4. Are they isomorphic? 
Support. 
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4.3.24 Show that G; ~ G2 implies that AutG,; ~ AutG 2. Show further by means 
of an example that AutG; ~*~ AutGz does not imply that G,; ~*~ Go. 


4.3.25 Let f beahomomorphism from (Q‘, -) to (Q, +). Suppose that f(3) = —1. 
Find f (27). 


4.3.26 Show that (C*, -) and (R%*, -) are not isomorphic. 


4.3.27 Let X denote a square in R?. Show that Iso (R’) is not isomorphic to Qs, 
whereas they are nonabelian groups of same order. 


4.3.28 Let (G, o) be an abelian group and c € G. Consider the group (G, *), where 
axb = aoboc™'. Define amap f from G to G by f(a) = aoc. Show that f is 
an isomorphism. 


4.3.29 Show that Z,4 is not isomorphic to V4. Show also that Hom(Za4, V4) © V4. 


4.3.30 Show that ahomomorphism f from a group (G1, 01) to (G2, 02) is injective 
if and only if f~'({e2}) = {ei}. 

4.3.31 Let (G, 0) be a group such that a ~» a", a ~ a"*!, anda ~» a"*? 
homomorphisms, n > |. Show that (G, 0) is abelian. 


are 


4.3.32 Let (A, 0;) and (Az, 02) be abelian groups and f a homomorphism from 
A, to Ao. Let (G, 0) be a group. Define a map f, from Hom(G, A;) to Hom(G, Az) 
by f*(g) = fog. Show that f, is a homomorphism which is injective provided 
that f is injective. Suppose further that (G, 0) is also abelian. Define a map f* from 
Hom(A2, G) to Hom(A,, G) by f*(g) = gof. Show that f* is a homomorphism 
which is surjective provided that f is surjective. 


4.3.33 Show that (R, +) and (S', -) are not isomorphic. 


4.3.34 Let G be a finite group and f an automorphism of G such that f(x) = 
x implies that x = e. Show that the map g from G to G defined by g(x) = 
x! f(x) is bijective. Suppose that fof = Ig. Show that G is abelian and f(x) = 
x! forall x € G. 


4.3.35 Let (G, 0) be a group containing more than two elements. Show that there 
is a nonidentity automorphism of (G, 0). 

Hint. If there is an element a € G which does not commute with all elements of G, 
then the inner automorphism determined by a is nonidentity automorphism. If G is 
abelian and a~! #4 a for some a in G, then a ~» a~! is anonidentity automorphism. 
Ifa = a™' for alla € G, then take any two distinct nonidentity elements a, b € G 
and define a map f from G to G by f(a) = b, f(b) = a,and f(x) = x forall 
x different from a, b. Check that f is a nonidentity automorphism. 


4.3.36 Determine the groups Hom((Q, +), (R, +)), Hom((Q, +), (C, +)), 
Hom,((R, +), (R, +)), Hom,((R, +), (s', ‘)), and Hom.((S', ), (St, -)). 
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4.3.37 Let f be anontrivial homomorphism from (Q, +) to (R, +). Show that f(Q) 
is dense in R. 


4.3.38 Let f be a homomorphism from (R, +) to (R, +) ((S!, -)) which is contin- 
uous at a point. Show that f is continuous (indeed differentiable) at all points. 


4.3.39 Let f be an automorphism of (IR, +) which is also semigroup homomor- 
phism from the semigroup (R, -) to itself. Show that f is the identity map. 


4.4 Generation of Groups 


To study a mathematical structure, it is always good to have sufficiently many exam- 
ples of that structure. This is because examples give deep insight to the structure. 
Thus, one tries to develop different construction processes by which one can con- 
struct different examples from known examples. The first step in this direction would 
be to see as to how to construct groups out of subsets of a group G. 

Let (G, 0) be a group and H a nonempty subset of G. To make H a group, we 
need a binary operation on H. For that purpose, we may look at the binary operation 
o of G. If aob € H forall a,b € H, then the binary operation o of G induces a 
binary operation o’ on H defined by ao'b = aob for all a,b € H. Even now, 
H together with the induced binary operation o’ need not be a group. For example, 
(Z, +) is a group, where + induces binary operation on N but N together with the 
induced addition is not a group. This motivates to have the following definition. 


Definition 4.4.1 Let (G, 0) be a group. A nonempty subset H of G is called a 
subgroup of (G, o) if the binary operation o of G induces a binary operation on H 
with respect to which H is a group. 


Thus, if H is a subgroup, then aob € H foralla,be€ H. 


Proposition 4.4.2 Let (G, 0) bea group and H a subgroup of G. Denote the induced 
binary operation on H by o’. Let e be the identity of the group (G, 0) and e' the identity 
of the group (H, o'). Leta € H. Let a~' denote the inverse of a considered as an 
element of the group (G, 0) and ag the inverse of a considered as an element of the 
group (H, o'). Then, 


(i)e=e 
and 

(ii) a = a; 

Proof Since o’ is the induced operation on H,aob = ao‘bforalla,b € H. Further, 

since e is the identity of G and é’ is the identity of H, we have 


to of tak / / 
eoe =eoe =e = e@0e. 


By cancellation law in (G, 0),e = e’. Further, 
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= at pen, 7s, es $n ST 
dy, 0d = a,0a =e@ =e = a oa. 


By cancelation law, aj! = a7}. tt 
The following corollary is immediate. 


Corollary 4.4.3. Let (G, 0) be a group. Then a subset H of G is a subgroup if and 
only if 
(i) aobe HVa,be H. 
(ii) ee H. 
(iii) a~' € H forallaeé H. t 


Proposition 4.4.4 Let (G, 0) be a group. A nonempty subset H of G is a subgroup 
if and only ifaob-' € H(a“'ob € H) foralla,b € H. 


Proof If H is a subgroup and a,b € H, then from the above corollary a,b~! € 
H, and so aob-! € H(a~'ob € H). Conversely, suppose that H # @ and 
aob7! for all a,b € H. Since H # Q, there is an element a € H. Hence, 
e = aoa”! € H.Further, sincee € H,ifa €¢ H,thena~! = eoa™! € H. Finally, 
if a,b € H, then as already proved, b~' € H. Hence, aob = ao(b~')~! € H. 


From the above corollary, it follows that H is a subgroup of G. tt 


The proof of the following proposition is by the induction and is left as an exercise. 


Proposition 4.4.5 Let H be a subgroup of (G, 0). Then, 


(i) a” €H forallaé€ HandneZ. 
(ii) a;'oay’o---oa™ € H whenever ay, a2,...,a, € H andny,no,...,n, € Z. t 


Proposition 4.4.6 Let (G, 0) be a group. Then a finite nonempty subset H of G is 
a subgroup if and only ifaob € H forall a,b € H. 


Proof The condition is necessary because of Corollary 4.4.2 (i) Conversely, suppose 
that H is anonempty finite subset of G such that aob € H foralla,b € H.Then, the 
binary operation o induces a binary operation o’ on H. Clearly, (H, o’) is a nonempty 
finite semigroup in which cancelation law holds. Hence, (H, o’) is a group at its own 
right. This means that H is a subgroup of G. tt 


Example 4.4.7 {e}, where e is the identity of the group (G, 0), is a subgroup of 
(G, o) called the trivial subgroup. G is also a subgroup of G called the improper 
subgroup of (G, o). Other subgroups of G are called proper subgroups of (G, 0). 


Example 4.4.8 Z is a subgroup of (Q, +), Q is a subgroup of (R, +), and R is a 
subgroup of (C, +). 


Example 4.4.9 S' is a subgroup of (C*,-), and the set P of roots of unity is a 
subgroup of (S!, -). 
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Example 4.4.10 {e, a}, {e, b}, {e, c} are all proper subgroups of V4 (verify). Thus, 
there are five subgroups of V4: one trivial, one improper, and three proper subgroups 
of V4. 


Example 4.4.11 Nontrivial proper subgroups of the Quaternion group Qxs can be 
enumerated as {1, —1}, {1,7, —1, —7}, {1, 7, -1, —j}, and {1,k, —1, —k}. Thus, 
there are 6 subgroups of Qs. 


Example 4.4.12 In this example, we find all subgroups of (Z, +). Let m € NL {0}. 
Then, mZ = {mr | r € Z} is a subgroup of Z (verify). They are all distinct, for 
ifmZ = nZ,thenm = nr andn = ms for somer,s € N(){O}. But then 
m = msr.Clearly,m = Oifand only ifn = 0.Ifm #0, then, rs = 1. Since 
r,sEN,r = 1 = s. Thus,mZ = nZif and only ifm = n. Lastly, we show 
that every subgroup of (Z, +) is of the form mZ for some m € N {0}. Let H bea 
subgroup of (Z, +). If H = {0}, then H = 0-Z. Suppose that H 4 {0}. Then, 
there exists r € H — {0}. Since H is a subgroup of (Z, +), the inverse —r of r 
belongs to H. Hence, there is an element r ¢ N(]) H, andsoN()H # &. By the 
well-ordering principle in N, N( H has the least element m (say). We show that 
H = mZ,.Sincem € H and H isasubgroup of (Z, +),mr € H forallr € Z. Thus, 
mZ C H.Leth € H. Since m £ 0, by the division algorithm, there existq,r € Z 
such thath = mq + r,wherer = OorelseO < r < m.Sinceh € H and also 
mq € H,and H is asubgroup,r = h—mq € H.Hence,r = 0, for otherwise 
r €N()H andr < m,acontradiction to the choice of m. Thus,h = mq € mZ. 
To summarize, the set S(Z, +) of all subgroups of (Z, +) is {mZ | m € N L{0}} 
and there is a bijection from the set S(Z, +) to N L){0} given by mZ ~» m. 


Example 4.4.13 In this example, we find all subgroups of (Z,,, ®), where m > 0. 
{0} is the trivial subgroup. Let H + {0} be a subgroup of (Z,,,@). Then, 
there exists r, 0 < r < msuchthatr e€ H. Thus, the set {r ¢ N | r € A} is 
nonempty set. By well-ordering property in N, this set has the least element ¢ (say). 
We first show that ¢ divides m. By the division algorithm, there exist q,r € Zsuch 
thatm = tq + r,wherer = OorelseO < r < ¢.But,then0 = m = qt +7. 
Thus,7 = —qt € H.Hence,r = 0, for otherwise 7 € H andO < r < t. Thus, 
t divides m. Let m = qt. Then, we show that 


H = {0,#,2¢,...,(@— Dé}. 
Since f € H and H is a subgroup, 


(0, 7,27,...,.@—Di} CH 
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Leta €¢ H, 0< a <m-—1. By the division algorithm there exist 1,r € Z such 
that 


a=lt+yr, 
where r = Oorelse0 < r < t. Since @ and f belong to H and H is a subgroup, 
r = a — It € H.Hence,r = 0, for otherwise 7 € HandO < r < ft,a 
contradiction to the choice of t. Hence,@ = Jt = lt, where! € {0,1,2,...,q—1}. 


This shows that 
H = (0,7,2f,...,(¢@ — Df}. 


Thus, every subgroup H of (Z,,, ®) is determined by a unique divisor t of m in 
the sense described above (note that {0} is determined by m). Next, we observe 
that distinct divisors determine distinct subgroups, for the subgroups determined by 
distinct divisors have distinct orders (the order of the subgroup determined by the 
divisor t isq, wherem = qt). Tosummarize, every subgroup of (Z,,, ®) determines 
and is uniquely determined by a positive divisor ¢ in the sense described above. In 
particular, there are as many subgroups of (Z,,, ®) as many divisors of m. 


Remark 4.4.14. We have a function t : N —~> N (called the divisor function) 
defined by 7(n) = the number of divisors of n. Clearly, the number 7 (7) of divisors 
ofn = p{'p;’... pe’, where pj, po,..., p, are distinct primes, is given by 


T(n) = (ay + 1)(aa +1)... (a, +1). 


Thus, 7(7) is the number of subgroups of (Z,,, ®). 


Definition 4.4.15 Let (G, 0) be a group. Then, Z(G) = {a € G | aox = 
xoa for all x € G} is a subgroup of (G, 0) (verify) called the center of (G, 0). 
Let a € G. Then, Cg(a) = {x € G | aox = xoa} is a subgroup of (G, 0) 
(verify) called the centralizer of a in (G, 0). Again, given a subset S of G, consider 
Ne(S) = {x €G | xoS = Sox}, where xoS = {xoy | y € G}. Then, Ncg(S) 
is a subgroup of (G, 0) (verify) called the normalizer of S in (G, 0). 


The center Z(Qg) of the Qg is the subgroup {1, —1}. The centralizer Cg, (i) of 
i in Qg is the subgroup {1, i, —1, —i} of Qg. The normalizer of {1, 7, —1, —i} in Qg 
is Qg itself. 


Example 4.4.16 In this example, we describe all subgroups of (IR, +). Let H be a 
nontrivial subgroup of (R, +). Ifa ¢ H, then —a € H. Thus, H (|) R* 4 @. There 
are two cases: 

Case (i) H () R* has the least element h (say). 

Case (ii) H (| Rt has no least element (i.e., 0 is the limit point of H). 
Consider the case (i). We show that H = {nh |n € Z}. Sinceh € H, {nh |ne 
Z} C H.Leta € Handa > 0.Sinceh > 0, by the Archimedean property of R, 
there exists n € N such that ¢ < n. Thus, the set {n eN| ¢ < n} #9. By the 


well-ordering property of N, it has least element m (say). Then, 0 < m—1 < ¢ 


h < mM. 
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This means that (m — 1)h < a < mh. We show thata = (m — 1)h. Suppose 
not. Then (m — 1)h < a < mh. Sincea € H and (m — 1)h € H, it follows that 
a—(m—1)h € H. Clearly,0 < a—(m—Wh < mh—(m—I1)h = h. This 
is a contradiction to the supposition that / is the least element of H (| R*. Thus, 
a e€{nh|ne€ Z}.Ifa < 0, then from what we have proved above, it follows that 
—a = nhforsomen € Z,andsoa = (—n)h. This shows that H = {nh | n € Z}. 

Next, consider the case(ii). In this case, we show that between any two distinct real 
numbers there is a member of H (such sets are called dense sets). Ifa < 0 < b, 
then there is nothing to do (0 € H). Suppose thatO < a < b. Takee = bra 
Since H (] Rt has no least element, there ish € H, h A Oandh < ec. By the 
Archimedean property of R and well-ordering property of N, we have the smallest 
n € Nsuchthata < nh. Suppose thatnh > b. Then, (n—1)h > a, acontradiction 
to the choice of n. Thus,a < nh <_ b. Since H is a subgroup, nh € H. Next, 
suppose thata < b < 0. Then, 0 < —b < —a. Hence, from the previous case, 
there ik € H suchthat —b < k < —a. But, thena < —k < b. Since H 
is a subgroup, —k € H. To summarize, a subgroup H of (IR, +) either consists of 
integral multiples of a fixed real number, or it is dense in the sense that between any 
two real numbers there is a member of H. In particular, a proper closed subgroup of 
(R, +) consists of all integral multiples of a fixed real number. 


Operations on Subgroups 
Proposition 4.4.17 Intersection of a family of subgroups is a subgroup. 


Proof Let {H, | a € I} be a family of subgroups of a group (G, 0). Since Hy is 
a subgroup for all a € I, e € H, forall a € I. Hence, e € (\,-; Ho. Thus, 
Meer Ha 4G. Let a,b € (),-; Ho. Then, a,b € Hy for all a € I. Since each Hy 
is a subgroup, aob~! € H, forall a € I. Hence, aob™! € (),,<; Ha. This shows 
that (|, Ha is a subgroup. tt 


ael 


Remark 4.4.18 Union of subgroups need not be a subgroup. For example, {e, a} 
and {e, b} are subgroups of the Klein’s four group, but their union {e, a, b} is not a 
subgroup(it does not containab = c). 


Proposition 4.4.19 Let H, and H be subgroups of a group (G, 0). Then H, J Hz 
is a subgroup if and only if H, © Hy or Ay © Ay. 


Proof If H; © Ho, then H; ) Hp = Hp isasubgroup. If Hy C A), Hi VU Hy = Mi 
is a subgroup. Conversely, suppose that H; J Hp is a subgroup and H, ¢ Hp. Let 
h € Hy — Hy. Let k be any element of Hy. Then, h,k € Hy J Ao. Since H; Ap is 
a subgroup, hok € H, ) Hy. Now, hok ¢ Hp, for otherwise h = (hok)ok—! would 
become a member of H>, a contradiction to the choice of h. Since h € Hy and Hj, is 
a subgroup, k = h-'ohok belongs to H;. This shows that Hy C Hj. tt 


Definition 4.4.20 A family {H, | a € J} of subgroups of a group (G, 0) is called a 
chain of subgroups if given a, 3 € J, Ha, C Hg or Hg © Hg. 


Proposition 4.4.21 Union of a chain of subgroups is a subgroup. 
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Proof Let {H, | a € I} be a chain of subgroups of a group (G, 0). Clearly, e € 
User Ha. Let a,b € Une; Ha. Then, a € H, and b € Hz for some a, ( € 1. Since 
the given family is a chain, H, © Hg or Hg C Hy. Thus, a,b € H, or a,b € Hg. 
Since H,, and Hg are subgroups, aob"! belongs to H, or to Hg. This shows that 
ab~' belongs to the union L),-; Ha. tf 


Definition 4.4.22 Let (G, 0) be a group and A, B be subsets of G. Then, the set 
AoB = {aob|aeéAand be B} is called the product of A and B. 


Thus, the product of subsets of G defines a binary operation on the power set 
§(G) of G. Is it a group? Is it a semigroup? 


Proposition 4.4.23 Let H and K be subgroups of a group (G, 0). Then, HoK is a 
subgroup if and only if HoK = KoH. 


Proof Suppose that HoK = _ KoH. Clearly,e =  eoe belongs to Hok. 
Thus, HoK # @. Let aob,cod € HoK, where a,c € H and b,d € K. Then, 
(aob)o(cod)~! = aobod-'oc~! = aokoc™'!, wherek = bod=' € K.Now, 
koc-! € KoH.Since KoH = HoK,koc~! € HoK. Suppose that koc~! = uov, 
where u € H andv € K. Then, (aob)o(cod)~'! = aouov € HoK. This shows that 
HoK is a subgroup. 

Conversely, suppose that HoK is a subgroup. Let hok € HoK, where h € H 
and k € K. Then, k~!oh=! = (hok)~! € HoK. Suppose that k~'!oh~! = uov, 
where u € H and v € K. Then, hok = v~'ou~! € KoH. This shows that 
HokK C Kol. Further, let koh € KoH, where k € K andh € H.Thenkoh = 
(h-'ok~!)"! © HoK, for h~'ok~! € HoK and HoK isa subgroup. This shows 
that KoH C HokK. It follows that HoK = KoH. tt 


Remark 4.4.24 In general, HoK need not be a subgroup. Consider the symmetric 
group S3 of degree 3 of all bijective maps from {1, 2, 3} to itself (see Example 
4.1.24). Let H = {J, f}, where J is the identity map on {1, 2, 3} and f is the map 
given by f(1) = 2, f(2) = l1,and f(3) = 3.Clearly, fof = J and hence H 
is a subgroup of $3. Let K = {J,g}, where g is given by g(1) = 1, g(2) = 3, 
and g(3) = 2. Then as above, K is also a subgroup. Now, HoK = {I, f,g, fog} 
is not a subgroup for gof ¢ HoKk (verify). 


Homomorphisms and Subgroups 


Proposition 4.4.25 Let f be a homomorphism from a group (G1, 01) to (Ga, 02). 
Let H, be a subgroup of (G,, 01) and Hz a subgroup of (G2, 02). Then f (HM) is a 
subgroup of (Gi, 01) and f~'(H) is a subgroup of (G4, 01). 


Proof Since e; € Hy (HM, is a subgroup), eg = f(e;) € f(A). Thus, 
f(A) 4G. Let f(a), f(b) € f(A), wherea, b € H,.Since f isahomomorphism, 
f (@orf (b)~! = f(ao,b™'). Further, since H; is a subgroup, ao,b~! € H, and so 
f(@orf(b)~! = f(ao,b~') € f(A). This shows that f(H}) is a subgroup. 
Again, since f(e;) = 2 (for, f is a homomorphism) and e2 € HA» (for, Az 
is a subgroup), e, € f~'(H>). Thus, f-'(Hb) A O. Let a,b € f~!(Hb). Then, 


128 4 Group Theory 


f(a, f(b) € Hy. Since Hy is a subgroup, f(ao,b“!) = f(@orf(b)"! € Hb. In 
turn, ao,;b7! © f(b); This shows that i (H2) is a subgroup. tt 


Corollary 4.4.26 Let f be a homomorphism from a group (G\, 01) to a group 
(G2, 02). Then, f—'({e2}) is a subgroup of (Gj, 04). 


Proof Since {e2} is a subgroup of (Go, 02), the result follows from the above propo- 
sition. tt 


Definition 4.4.27 Let f be a homomorphism from a group (Gj, 0;) to a group 
(Go, 02). Then, f~'({e2}) = {a € G; | f(a) = e2} is called the Kernel of f and 
is denoted by ker f. 


Thus, ker f is a subgroup of (G1, 01). If f is a homomorphism, then f(a) = 
f(b) if and only if a~'ob € ker f. Evidently, f is injective if and only if 
ker f = {ey}. 

Proposition 4.4.28 Let H be a subgroup of a group (G1, 0;) which contains the 
kernel of a homomorphism f from (G1, 01) to a group (G2, 02). Then 


f(D) = H. 
Proof Clearly, H C f~'(f(H)). Leta € f~'(f(A)). Then, f(a) € f(H), and 
so f(a) = f(h) for some h € H. But, then aojh-'! € ker f C H. Thus, 
a = ao,h~'o,h belongs to H. This shows that f~'(f(H)) C H. t 


Theorem 4.4.29 (Correspondence Theorem) Let f be a surjective homomorphism 
froma group (Gj, 01) to a group (G2, 02). Let S(G) denote the set of all subgroups 
of (G,, 0,) which contain kernel of f and S(G2) denote the set of all subgroups of 
(Gy, 02). Then f induces bijective map ¢ from S(G) to S(G2) defined by ¢(H) = 
FS (A) (the image of H under f ). 


Proof Suppose that 6(H,) = (Az). Then, f(H,) = f(A). From the above 
proposition, H;} = f—'(f(M)) = f-'U Ub)) = Hy). Thus, ¢ is injective. Next, 
let K € S(G2). Then, f~'(K) is a subgroup containing f~'({e2}) = ker f. Hence, 
f-'(K) € S(G)). Since f is surjective, 6(f~'(K)) = f(f~'(K)) = K. Thus, 
o is surjective also. tt 


Subgroup Generated by a Subset S 


We have observed that a subset S of a group G need not be a subgroup of G. A 
natural question is: “How far is the subset S from being a subgroup?’ In other words, 
how can we make S a subgroup with minimum effort? Taking out elements from S 
will not help in general (for if e ¢ S, taking out elements from S will never make it a 
subgroup). We can always make S' a subgroup by putting some members of G in to $ 
(if worst comes, by putting all members of G — S in S). Again, as desired, we should 
add minimum number of elements of G to make it a subgroup. In other words, we 
are interested in the smallest subgroup of (G, 0) containing S. The existence of such 
a subgroup follows from the following theorem. 
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Theorem 4.4.30 Let (G,o) be a group and S a subset of G. Then the smallest 
subgroup of G containing S exists and is unique. 


Proof The family of all subgroups containing S is nonempty. The intersection of this 
family of subgroups is the smallest subgroup of (G, 0) containing S. tt 


Definition 4.4.31 Let S be a subset of a group G. Then, the smallest subgroup of G 
containing S is called the subgroup generated by S. This subgroup is denoted by 
<S>. 


Remark 4.4.32 A subset H of a group (G, 0) is the subgroup generated by S if and 
only if 


(i) A is a subgroup. 
gi) SCH. 
(iii) If K is a subgroup of (G, 0) containing S, then H C K 


Proposition 4.4.33 Let (G, 0) be a group and S a nonempty subset of G. Then the 
subgroup generated by S is the set of all finite products of integral powers of elements 
of S. More explicitly, 


<S>= {aj'oayo---oa | a; € Sand n; € Z} 


Proof Let X denote the set of all finite products of integral powers of S. Since every 
element of S is an integral power of the same element, S C X. Next, if H isa 
subgroup of G containing S, then by Proposition 4.4.5, aj'oa3’o0...oa” belong to 
AT for all a), do, ..., a, in S and integers n;,2,...,”,. Further, since products of 
integral powers of elements of S are again a product of integral powers of elements 
of S, and also the inverse of a product of integral powers of elements of S' is again a 
product of integral powers of elements of S, it follows that X is a subgroup. Thus, 


X is the smallest subgroup of G containing S,andso< S>= X. tt 


Following is an immediate corollary of the above proposition. 


Corollary 4.4.34 Let (G,o) be a group and a € G. Then, < {a} >= {a” | 
neé Z}. t 


The subgroup generated by a singleton {a} is conveniently denoted by < a > 
instead of < {a} >. 


Example 4.4.35 <@>= {e}, for {e} is the smallest subgroup of G which contains 
the @. 


Example 4.4.36 The subgroup generated by {a} in V4 is {e, a}, for {e, a} is the 
smallest subgroup which contains {a}. The subgroup generated by {a, b} is the group 
V4 itself, for any subgroup of V4 containing a, b will containc = aob and e also. 
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Example 4.4.37 The subgroup of the Quaternion group Qg generated by i is {1, —1, 
i, —i}, for : is a subgroup and any subgroup which contains i contains 1,—-1 = 
i?,—i = i andi also. The subgroup < {i, j} > is Qs itself, for if a subgroup of oe 
contains {i, 7}, then it also contains 1, —1,i, —i, 7, —j,k = i-jand—k = 


Example 4.4.38 The subgroup of (Z,-+) generated by {m} is mZ (the set of all 
integral multiples of m), for mZ is the smallest subgroup of (Z, +) containing {m}. 
In particular, Z =<1>=<-l1>. 


Example 4.4.39 In this example, we determine subgroup generated by an arbitrary 
subset S of the additive group Z of integers. If S = @, then < S > = {0}. Since 
< S>=< S-— {0} >, we may assume that 0 ¢ S. Let d be the positive greatest 
common divisor of elements of S (g.c.d of an infinite set can be defined in the same 
manner as it is done for finite set of integers). Then, we show that < S$ >= dZ. 
Clearly, S C dZ as all members of S are multiples of d. Also, dZ is a subgroup of 
(Z, +). Next, let H be a subgroup of (Z, +) containing S. We can find a finite subset 


{m,,m,...,m,} of S such that g.c.d of S is the same as that of m,,m2,..., m,. By 
the Euclidean algorithm, there exist integers u,, U2, ..., u, such that 
d = uym, + uom2 + +++ +u;-m, 


Let dt € dZ,t € Z. Then, 
dt = tuym, + tugmz + ---+tu,m, 


is amember of H, for H is a subgroup and m1, m2,...,m, € H 


Example 4.4.40 In this example, we determine the subgroup generated by an arbi- 
trary subset of the group (Z,,, ®). Let S be a subset of Z,,. If S = Bor S = {0}, 
then < S > = {0}. Let 


S = {r1,72,-..,Tt}; 
where we can assume that 7; 4 0 for alli = 1,2,...,f. This means that m does 
not divide r; for all i. Let d be g.c.d of m,7r1, r2,...,7,. We show that 


<S>= {0, d,2d,...,(¢—ldt, 


where m = qd. Since d divides r; for all i, it is clear that 


{71,12,---,Tr} Cc (0, d,2d,...,@—- 1)d} 
Further, it is easy to observe that {0, 2d,..., (q- 1)d} isa subgroup of Z,,. Next, let 
H beasubgroup of (Z,,, ®) containing {71, 72, ..., 7;}. By the Euclidean algorithm, 


there exist integers u, U1, U2,..., U, Such that 


d= um + uwyyry +--+ Yr; 
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But, then 


d = uyry + uorg + +++ + yr; 


belongs to H, and since H is a subgroup, {0, d, 2d,..., (¢—1)d} © H. This shows 
that ae = 
<S>= {0,d,2d,...,(q —1)d}. 


Example 4.4.41 As aparticular case of the above example, we find that the subgroup 
of Zm generated by {7} is {0,d, 2d,..., (q — 1)d}, where d is g.c.d of m and r and 
qd = m. In particular, singleton {7} generates Z,, if and only if m and r are co- 
prime. Thus, there are as many singleton generators of Z,, as many positive integers 
less than m and co-prime to m. It follows that there are }(m) singleton generators of 
Zn: 


Definition 4.4.42 Let (G, 0) be a group. A subset S of G is said to be a set of 
generators of G if < S > = G.A group (G, 0) is said to be finitely generated if 
it has a finite set of generators. 


Example 4.4.43 {1} and {—1} are the only singleton generators of (Z, +) (prove it). 
It follows from examples above that the set {71, m2,...,m,} is a set of generators 
of Z if and only if the positive g.c.d of the above set is 1. It also follows that the set 
{r1, 72, .-.-,7;} 18 a set of generators of the group (Z,,, ®) if and only if the positive 
g.c.d of m,r},ro,...,7; is 1. There are @(m) singleton generators of (Z,,, ®). 


Example 4.4.44. The group (Q, +) is not finitely generated: Let S be a finite subset 
of Q. Let P be the set of primes appearing in the denominators of the members of S. 
Then, P is finite and so there are primes outside P. It is clear that < S$ > contains 
no rational number in reduced form whose denominator is a prime not in P. Hence, 


<S>¢Q. 


Proposition 4.4.45 Let f be a surjective homomorphism from a group (Gj, 01) to 
a group (G2, 02). Let S be a set of generators of (G,, 01). Then, f(S) is a set of 
generators of (G2, 02). 


Proof Let K be a subgroup of G2 containing f(S). Then, f~!(K) is a subgroup of 
G, containing S. Since S generates G, it follows that f~'(K) = Gj. Since f is 
surjective, K = f(f~'(K)) = f(G;) = G». This shows that f(S) generates 


G>. st 
Corollary 4.4.46 Any homomorphic image of a finitely generated group is finitely 
generated. f 
Exercises 


4.4.1 Determine all subgroups of (Zs, @). 


4.4.2 Determine all subgroups of (Uj2, x). 
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4.4.3 Let H be a subgroup of (G, 0). Show that goHog! is a subgroup of G for 
allg eG. 


4.4.4 Find the normalizers of all subgroups of the Quaternion group. Find also the 
centralizers of all elements in the symmetric group $3. 


4.4.5 Let (G, 0) be a group. Show that the set /(G) of all inner automorphisms of 
G is a subgroup of Aut(G). 


4.4.6 Find the center of the symmetric group $3 and also of the Dihedral group D4. 
4.4.7 Show that a group cannot be written as union of two proper subgroups. 
4.4.8 Find the number of subgroups of (Z299, @). 

4.4.9 Find the center of the Quaternion group Qs. 


4.4.10 Let (G,, 0,) and (G2, 02) be groups. Show that a map f from G, to G2 isa 
homomorphism if and only if f considered as a subset of the direct product G, x G2 
is a subgroup. 


4.4.11 Find the kernel of the homomorphism f from the additive group R of real 
numbers to the circle group S! defined by f(x) = e!*. 


4.4.12 Find the kernel of the homomorphism v from (Z, +) to (Z, ®) defined by 
v(a) = @. 


4.4.13 Show that a homomorphism / is injective if and only ifker f = {e} 
4.4.14 Describe all subgroups of (Rt, -) and also its closed subgroups. 

4.4.15 Describe all subgroups of (P, -) and also of (S', -). 

4.4.16 Describe all subgroups and also the closed subgroups of the circle group S!. 
4.4.17 Describe all subgroups and also all the closed subgroups of (C, +). 


4.4.18 Let f be a surjective homomorphism from (G), 0)) to (G2, 02). Suppose 
that G2 is finitely generated and also the kernel of f is finitely generated. Show that 
G; is also finitely generated. 


4.4.19 Describe all subgroups of Z x Z. Generalize it for the products of finitely 
many copies of the additive group Z of integers. 


4.4.20 (a) Use the above result to show that subgroup of a finitely generated abelian 
group is finitely generated. (observe that every finitely generated abelian group is 
homomorphic image of products of finitely many copies of Z). It may be mentioned 
that the above result is not true for nonabelian groups. In fact, every countably 
generated group is subgroup of a group which is generated by two elements. 

(b) Show that every finitely generated group is finite or at most countable. Deduce 
that the additive group R of reals and the circle group S! are not finitely generated. 
Observe that the additive group Q of rational numbers is countable, but it is not 
finitely generated. 
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4.4.21 Let S be a subset of a group (G, 0). Let 
Cco($) = {g € G| gox = xogVx € G} 


Show that Cg(S) is a subgroup of G. This subgroup is called the centralizer of S$ in 
G. Observe that Cg(S) C Ng(S). Show by means of an example that equality need 
not hold. If S = {a}, then Cg(S) is denoted by Cg(a). 


4.4.22 Calla subgroup M of a group to be a maximal subgroup of a group (G, 0) 
if (i) M ¥ G, and (ii) there is no proper subgroup of G containing M properly. Show 
that (Q, +) has no maximal subgroup. 


4.4.23 Show that every finitely generated group has a maximal subgroup. 


4.4.24 Suppose that S is a set of generators of the group (Q, +) and F a finite subset 
of S. Show that S — F is also a set of generators of Q. 


4.4.25 Show that the group P of roots of unity contains no maximal subgroup. 


4.4.26 Show that if a group (G, 0) is finitely generated, then every set of generators 
of G contains a finite set of generators. 


4.4.27 Let S and T be subsets of a group (G, o) and H asubgroup such that § C H. 
Show that (ST (| H) = S(T () H). This identity is called the Dedekind law. 


4.4.28 Let G be a finite group of order n. Show that | Aut(G) | < n!?2™, 

Hint. If G has a set X = {aj,a2,...,dm} of generators containing m elements, 
then any automorphism of G is completely determined by its effect on X and so it 
contains at most n” elements. Consider an ead m such that < X > = G. Let 


HH; =< {a 1, a,...,a;} >. Observe that ima 2. Conclude that n > 2”. 


4.4.29* Let (G, o) be finite group of order n. Let m be the number of prime divisors 
of n. Show that | Aut(G) |< n”. Give an example to show that equality may hold. 


4.4.30 Define a map f from R to S! by f(x) = e?"*. Show that f is a surjective 
homomorphism. Find the kernel of f. Also find f~'(P), where P is the subgroup 
of roots of unity. 


4.4.31 Define a map f from C to C* by f(z) = e*%, where C is the additive group 
of complex numbers and C* the multiplicative group of nonzero complex numbers. 
(i) Is f surjective? (i) Is f a homomorphism? If yes find the kernel. 


4.4.32 Show that {m+nJ/2| m,n € Z}isa subgroup of (R, +). Deduce that it is 
a dense subset of R. 


4.4.33 Show that {5; | m,n € Z}is asubgroup of (IR, +). Deduce that it is a dense 
subset of R. 
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4.4.34 Show that {e’Y2" | m € Z} is a subgroup of S!. Deduce that it is a dense 
subset of S!. 


4.4.35 Describe all finite subgroups of the circle group S!. 


4.4.36 Let G be a group all of whose proper subgroups are finitely generated. Can 
we conclude that G is also finitely generated. Support your claim. 


4.4.37 Find the number of singleton generators of Zy6. 


4.4.38 Let ¢2(n) denote the number of pair (7), r2) of integers such that | <r, < 
r2 <n and the positive g.c.d. of rj, 72, and n is 1. Find ¢2(p’), p is a prime and r 
is a positive integer. Find the number of doubleton generators of Zo. 


4.5 Cyclic Groups 


Order of an Element of a Group 


Let (G, o) be a group and a an element of G. The subgroup < {a} > generated by 
{a} is denoted by < a >. Thus, <a >= {a" |ne Z}. 


Definition 4.5.1 The subgroup < a > of (G,o) generated by {a} is called the 
cyclic subgroup generated by a. A group (G, 0) is said to be a cyclic group if it is 
generated by a single element. 


Let (G, 0) be a group and a € G. Define a map f from Z to G by f(n) = a”. 
Then, f is ahomomorphism (follows from the law of exponents), and the image of 
f is the cyclic subgroup < a >. There are two cases: 


(i) f is injective. 
(ii) f is not injective. 


In case (i), a is said to be of infinite order. Thus, a is of infinite order if and only 
ifa"” = a” implies that n = m. This means that distinct integral powers of a are 
distinct. 

Consider the case (ii) In this case, f is not injective, and so there exists a pair 
m,n of integers such thatm 4 n anda” = a”. But, thena’-”" = a'(a")"! = 
e = aa")! = a". Thus, in this case, there exists a? € N such that 
a’ = en—m€Norm—n€N). Therefore, the subset A = {1 € N| a! = e} of 
N is nonempty. By the well-ordering principle in N, A has the least element m (say). 
This least element m is called the order of a and is denoted by o(a). Thus, a natural 
number m is order of a if and only if a” = e and whenevern € Nanda” = e, 
msn. 


Proposition 4.5.2 Let (G, 0) be a group and a € G. Then a is of infinite order if 
and only if the kernel of the homomorphism f from Z to G defined by f(n) = a" 
is {O}. 
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Proof Since f is a homomorphism, it is injective if and only if the kernel of f is 


{0}. tt 


Proposition 4.5.3. Let (G, 0) be a group and a an element of finite order m. Then 
a" = eifand only ifm divides n' 


Proof Suppose that the order of a is m > 0. Then, by the definition of order of an 


element, m is the smallest positive integer such that a” = e. If m divides n, then 

n = mq forsomeg € Z. But, thena” = a! = (a”)? = e. Conversely, suppose 

that a” = e. By the division algorithm, there are integers g, 7 such that 
n=mq+yr, 


wherer = OorelseO < r < m.Now, 


e= aq” — quate = (a”)40a" = eoa’™ = a’ 
Hence, r = 0, for otherwise 0 < r < manda’ = e,acontradiction to the 
supposition that m is the order of a. Thus, m divides n. ft 


Corollary 4.5.4 Let (G, 0) be a group and a € G. Then the order of a is m if and 
only if Kernel of f is mZ, where f is the map given by f(n) = a". 


Proof ker f = {n€Z|a" = f(n) = e}. The result follows from the above 
proposition. tt 


Example 4.5.5 The order of the identity element e is 1, for 1 is the smallest positive 
integer such that e! = e. 


Example 4.5.6 Order of each nonidentity element a, b, c in V4 is 2, for e=bh= 
c? = e,anda! £e,b' £eandc! Ke. 


Example 4.5.7 Order of -1 in Qs is 2, for —1 € 1 and (—1)? = 1. Order of i is 
4,fori #1,2 = -141,% = -i 41, andi* = 1. Similarly, orders of 
j,k, -i, —j and —k are 4. 


Example 4.5.8 Order of each nonzero element in (Z, +) is infinite, for m 4 0, and 
n-m = Oimplies thatn = 0. 

Example 4.5.9 Order of lin Z,, ism,forrl = 7 = 0 if and only if m divides r. 
Example 4.5.10 Consider the multiplicative group Uo of prime residue classes mod- 
ulo 10, The order of 3 in Uj is 4, for3 41,3 = 91,3 = 27 =741, 
and3’ = 81 = T. 


Example 4.5.11 Order of each nonidentity elements of (Q, +) and (R, +) is infinite. 
Such groups are called torsion free. 
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Example 4.5.12 Order of each element of the group P of roots of unity is finite. Such 
groups are called torsion groups or periodic groups. Observe that P is infinite. 


Example 4.5.13 Order of some elements of S! (the circle group) is infinite and order 
of some elements of S! is finite. In fact, elements of finite order are those of P and 
those of S$! — P are of infinite orders. Such groups are called mixed groups. The 
subgroup P is the torsion part of S!. 


Proposition 4.5.14 J[f (G, 0) is a finite group, then order of each element of G is 
finite. 


Proof Leta € G. Then, the map n ~» a” can not be injective, for otherwise G would 
become infinite. tt 


Sometimes, we use notation | a | also for the order of an element of a of a group. 
Proposition 4.5.15 Let (G, 0) be a group anda € G. 


(i) If a is of infinite order, then the cyclic subgroup < a > generated by a is also 
infinite. 

(ii) Ifo(a) = mis finite, then the cyclic subgroup < a > generated by a is also 
finite and contains m elements. In fact, then 


2@s2 (ew ....;0"°) 


Proof (i) Ifa is of infinite order, then the map f from Z to <a >= {a" |n € Z} 
defined by f(n) = a” is a bijective homomorphism. Since Z is infinite, < a > 
is infinite. 

(ii) Suppose that o(a) = m. Then, a” = e ifand only ifm/n. Leta" €e<a>. 
By the division algorithm, there exist integers g andr such thatn = mq +r, 
where 0 < r < m. But, thena” = at" = (a™)40a" = a’ belongs to 
{e,a,a’,...,a™—'}. Thus, 


26> (4.82408) 


Observe further that all the elements of {e, a, a”,..., a’”—'} are distinct. tt 
The following corollary is immediate from the above proposition. 
Corollary 4.5.16 o(a) = o(<a>). tt 


Proposition 4.5.17 Let (G,o) be a group and a,b € G such that aob = boa. 
Suppose that o(a) = m, o(b) = n, andalso < a > (\ <b >= {e}. Then, 
o(aob) = [m,n] (the least common multiple of m and n). 


Proof Since aob = boa, (aob)’ = a’ob' forall r € Z. Thus, 


(aob) = eifand only ifa'ob’ = e 
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But, thena’ = b"' €<a>()<b>= {e}. Hence,a” = e = b™. Since 
o(a) = mando(b) = n,m andn both divide r. Hence, the least common multiple 
[m,n] divides r. Further, 


(aob)""") = qitnloplmal = eve =e 


This shows that order of aob = [m,n]. t 


Remark 4.5.18 If we drop the condition < a > {| < b > = {e} from the above 
proposition, then all that we can say is that o(aob) divides [m, n]. Further, if aob A 
boa, the o(aob) is highly indeterminate. It can be of any finite number, or it can be 
infinite also. 


Proposition 4.5.19 Let (G, 0) be a group. Leta € G and o(a) = n. Then, 


(i) <a >=<a" > 

(ii) o@") = 2. 
where (n, r) denotes the g.c.d of n and r. 

Proof (i) Letd = (n,r). Then, r = dq for some q. But, then a” = (a)? 
belongs to < a? >. Hence, < a” >C< a? >. Further, by the Euclidean 
algorithm, there exist u,v € Z such that un + vr = d. But, then 


at = qinter = (a")"o(a")” = (a’y" 


Hence, a? €< a" >, andso < a4 >C <a" >. This proves (i). 
(ii) From the Corollary 4.5.16, it follows that 


n 


(n,r) i 


o(a") = o(<a" >) = o(< a >) = oa) = 


Definition 4.5.20 A group (G, 0) is said to be cyclic if there is an element a € G 
such thatG =<a>. 


Thus, a group (G, 0) is cyclicifG = {a" |n € Z} forsomea eG. 


Example 4.5.21 The additive group (Z, +) of integers is cyclic. In fact, Z =< 1> 
=<-l>. 


Example 4.5.22 For each m > 0, the group (Zn, ®) is a cyclic group. In fact, 
Zm =< 1>. Also, Z, = <a > if and only if a and m are co-prime. 


Proposition 4.5.23 A group (G, 0) is cyclic if and only if it is homomorphic image 
of (Z, +). 


Proof Suppose that (G, 0) is cyclicand G = < a >. The map f from Z to G defined 
by f(n) = a” is surjective, for any element of G is of the form a” for some n € Z. 
By the law of exponents, it is also a homomorphism. Conversely, suppose that f is a 
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surjective homomorphism from the group (Z, +) to a group (G, 0). Let f(1) = a. 
Then, f(n) = a” forall n € Z. Since f is surjective,G = {a” | n € Z}. Thus, 
(G, o) is cyclic. tt 


Corollary 4.5.24 A homomorphic image of a cyclic group is a cyclic group. 


Proof Since composition of surjective homomorphisms is surjective homomor- 
phisms, it follows that homomorphic image of a homomorphic image of (Z, +) 
is ahomomorphic image of (Z, +). tt 


The following theorem says that essentially there are two types of cyclic groups: 
(i) (Z, +), and (ii) (Z,,, ®) for some m > 0. 


Theorem 4.5.25 Any two infinite cyclic groups are isomorphic. Any two finite groups 
of same orders are isomorphic. 


Proof Since the relation of isomorphism is an equivalence relation on any set of 
groups, it is sufficient to show that any infinite cyclic group is isomorphic to the 
group (Z, +), and any finite cyclic group of order m is isomorphic to (Zn, ®). Let 
(G, o) be an infinite cyclic group. Suppose that G =< a >. Then, a is of infinite 
order, for otherwise G will be a finite group (Proposition 4.5.15). Therefore, the 
map f from Z to G given by f(n) = a” is injective. It is already surjective 
homomorphism and so it is an isomorphism. 

Next, suppose that G = < a > contains m elements. Then, again by Proposition 
4.5.15, a is of order m. Suppose thatr = 5 in Z,,. Then, m/r — s. Since m is the 
order of a,a’~* = e,andhence,a” = a*. This ensures that we have a map f from 
Zm to G given by f(r) = a’. Clearly, f is a surjective homomorphism. Since both 
the groups contain the same number of elements, it is an isomorphism. tt 


The proofs of the following propositions are imitations of the arguments in Exam- 
ples 4.4.11 and 4.4.12. One can prove these propositions using these examples and 
the above theorem. However, we repeat the arguments again. 


Theorem 4.5.26 Every subgroup of a cyclic group is cyclic. 


Proof Let (G, 0) be a cyclic group and G =< a >. Let H be a subgroup of G. 
If H = {e}, then H =< e > is acyclic group generated by e. Suppose that 
H # {e}. Then, there exists an integer n #4 O such that a” € H. Since H isa 
subgroup, a” = (a")~! belongs to H. Hence, there exists 1 € N such that 
a' € H. By the well-ordering principle in N, we have the smallest m € N with the 
property that a” e€ H. We show that H =< a” >. Since H is a subgroup and 
a” é€ H, it follows that < a” >C H. Let a” ¢€ H. By the division algorithm, 
there exist q,r € Zsuchthatn = mq + r,whereO <r < m. But, then 
a” = amar = (a™)10a" = a’. Thus, a’ € H. Since m is the smallest positive 
integer such that a” € H, it follows thatr = 0. This shows that a” = (a’”)4 
belongs to < a” >. Hence, H =<a™ >. ft 
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Theorem 4.5.27 Let (G, 0) be an infinite cyclic group generated by a. Then, < 


a” > =<" > ifand only ifm = +n. In particular < a" > = G if and only if 
m= +1. 

Proof Clearly, < a” > =< a~™” >. Suppose that < a” > =< a” >. Then, 
there exist r,s € Zsuch thata” = (a")’ anda” = (a’)’. Since a is of infinite 
order,m = nrandn = ms. This shows thatrs = 1l,andsor = +1. Hence, 
m = +n, tt 


Corollary 4.5.28 There are infinitely many proper subgroups of an infinite cyclic 
group. 


Proof It follows from the above theorem that the map f from N (J {0} to the set S(G) 
of all subgroups of G given by f(n) = < a” > is bijective. tt 


Proposition 4.5.29 Let (G, 0) be a finite cyclic group of order m generated by a. 


Then, < a’ > =<. a’* > ifand only if (m,r) = (m, s) (equivalently lui] = Wms] ) 
Proof Since the order of a is m the order of G, the result follows from Theorem 
4.5.19. (Note that (m,r) = ial and (m,s) = mal .) tt 
Corollary 4.5.30 Let (G, 0) be a cyclic group of order m generated by a. Let d, 
and dy be divisors of m. Then < a” > = < a® > if and only if d; = +db. 

Proof Follows from the fact that d is a divisor of m if and only if (m,d) = +d. 


Corollary 4.5.31 Let (G,0) be a cyclic group of order m. Then the number of 
subgroups of G is T(m), where T is the divisor function (T(m) is the number of 
positive divisors of m). 


Proof Let (G, o) be the cyclic group generated by a. Then, order of a is also m. Let 
D denote the set of positive divisors of m. Then, from the above results, it follows 
that the map f from D to the set $(G) of subgroups of G defined by f(d) = < a4 > 
is bijective. tt 


Corollary 4.5.32 Corresponding to every divisor of the order of a cyclic group there 
is a unique subgroup of that order. 


Proof Let (G,o) be a cyclic group of order m generated by a. Let r/m. Then, 
<a‘ > is the unique subgroup of order r (note that if H is a subgroup of G, then 
there exists a unique positive divisor d of m such that H =< a‘ >). Further, the 


order of < a@ > is - tt 


Corollary 4.5.33 Let (G, 0) be acyclic group of order m generated by a. Then there 
are ¢(m) singleton generators of G, where is the Euler’s phi function. 


Proof We have already seen that < a’ > = < a”) > and also if d; and d) are 
two positive divisors of m such that < a! > =< a” >, then d; = d. Thus, 
<a’ >=<a>ifandonlyif(m,r) = 1.SinceG = {f{e,a, a, ..., a"), it 
follows that there are as many singleton generators of G as many positive integers 
less than m and co-prime to m. Thus, there are ¢(m) singleton generators of G. ff 
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Proposition 4.5.34 Let (G, 0) be a finite group such that the subgroups of G form 
a chain in the sense that for any pair H, K of subgroups of G, H C K or K C H. 
Then, G is a cyclic group of order p” for some prime p andn > 0. 


Proof Since G is finite, order of each element of G is finite. Let a be an element of 
G of highest order. We show thatG =< a >. Let b € G. Since subgroups of G 
form a chain, < b >C<a>or<a>C< b>. But < b > cannot contain < a > 
properly, for otherwise o(b) > o(a). Thus, < b >C<a>andsob €<a >. This 
shows that G is cyclic. Next, suppose that o(G) = o(a) = mis nota power of a 
prime. Then, m = mj mp for some m,,m with 1 < m; < mandl < mz < m 
and (m,,m2) = 1. But, then < a” > = f{e,a™,a7™,...,a%™-)"™} does not 
contain a’, and similarly, < a”? > does not contain a”!. This is contradiction to 
the hypothesis. ft 


Corollary 4.5.35 Let (G, 0) be a finite group such that union of any two subgroup 
of G is a subgroup. Then G is cyclic. tt 


Remark 4.5.36 The above proposition and the corollary are not true if we do not 
assume G to be finite. For example, let p be a prime number andG = {zeEC| 
z?" = 1 forsomen € N}. Then, G isa group with respect to complex multiplication 
which is not cyclic, but the family of subgroups of G form a chain (verify). 


An Application To Number Theory 


Theorem 4.5.37 Letn € N. Then, 


LajnP(d) = 1, 
where @ is the Euler ‘s phi function. 


Proof Let (G, 0) be a cyclic group of order n generated by a. Let X,y be the set of 
elements of order d of G. Since order of each element of G is a divisor (prove it) of 
n, Xq = , if d does not divide n. Also, 


c= d/n Xa 
Further, if r and s are distinct divisors of n, then X, (|X; = @. Hence, 0(G) = 
Xa/n0(Xq). Now, corresponding to each divisor d of n, there is a unique subgroup 
< b > of order d, where b = a7. Since each element of G of order d will generate 
a subgroup of order d, all the elements of Xq are elements of < b > which generate 
<b>.Thus, X¢ = {b" | 1<r-<nand (d,r) = 1}. Hence, o(X4) = ¢(d). 
This shows thatn = YgjnG(d). tt 
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Exercises 


4.5.1 Find the order of 3 in U;,. Show that Uj, is a cyclic group of order 10. 
4.5.2 Find the order of 8 in Z¢o. 


4.5.3 Find the number of subgroups of a cyclic group of order 200. Enumerate the 
subgroups of Zy09. 


4.5.4 Give an example of an infinite group all of whose elements are of finite order. 
4.5.5 Show that every cyclic group is abelian. 

4.5.6 Give an example of an abelian group which is not cyclic. 

4.5.7 Find the subgroup of Z generated by {20, 45, 50}. 


4.5.8 Let m € N. Suppose that (10,m) = 1. Show that r is the order of 10 in U,, 
if and only if the decimal representation of + recurs at the rth step. 


4.5.9 Find the subgroup of Zio generated by {4, 6, 18}. 


4.5.10 Suppose that o(a) = nando(b) = m. Suppose thata” = b~". Show that 
[n,r] = [m,r]. 


4.5.11 Find the number of singleton generators of Z3o9. 


4.5.12 Find the number of homomorphisms from a cyclic group of order 12 to a 
cyclic group of order 15. Also find them. 


4.5.13 Find the number of homomorphisms from a cyclic group of order 16 to the 
circle group S! and also find them. 


4.5.14 Give an example of a nonabelian group all of whose proper subgroups are 
cyclic. 
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4.5.15 Show that (Q, +), (R, +), and (C, +) are not cyclic groups. 


4.5.16 Show that every proper closed subgroup of (IR, +) is a cyclic group. 


4.5.17 Show that if all proper subgroups of a nonabelian group G are abelian, then 
G is generated by two elements. 


4.5.18 Show that all proper closed subgroups of the circle group S! are finite cyclic. 


4.5.19 Characterize a group all of whose proper nontrivial subgroups are isomor- 
phic to the group itself. 


4.5.20 Let (G, 0) be a group and a,b € G. Suppose that aob = boa o(a) = 
100 o(b) = 60. Suppose that a'° = b°. Find the order of aob if possible. 


4.5.21 Suppose that o(a) = 24, o(b) = 15, and aob = _ boa and also 
<a>{|\<b>= {e}. Find the order of aob. 


4.5.22 Suppose that o(a) = 18ando(b) = 25andab = ba. Find the order of aob. 


4.5.23 Derive all the results about cyclic groups using Examples 4.4.12 and 4.4.13 
and Theorem 4.5.25. 


4.5.24 Let f be a homomorphism from a group (G, 0) to a group (G’, 0’). Let 
a € G be an element of finite order. Show that f(a) is also of finite order and 
o(f (a)) divides o(a). 


4.5.25 Let (G, 0) be a group and a, b € G — {e}. Let p bea prime such that 2? — | 
is also a prime. Suppose that a? = {e} and aoboa~! = b?. Find the o(a) and o(b). 


4.5.26 Let (G, 0) bea group anda, b € G—{e}. Let p be aprime such thata’ = e 
and aoboa~! = b>. Suppose that 3? — 1 = 2q, where q is a prime. Suppose that 
aob # boa. Show that o(a) = p ando(b) is g or 3? — 1. 
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4.5.27 Let (G, 0) be a finite group of even order. Suppose that half of the elements 
of G are of order 2 and the rest of the elements of G form a subgroupH. Show that 
His an abelian subgroup of odd order. 

Hint. The number of elements of order 2 is odd. Thus, o(G) = 2n, where n is odd. 
Let b be an element of order 2 andc € H. Then, bc ¢ H. Hence, bocob = co! for 
all c € H. Deduce that c~!od~! = d@'oc™! forallc,d € H. 


4.5.28 Let (G, 0) be a group and a,b € G such that aob = boa. Suppose that 
o(a) = mando(b) = n. Show that G has an element of order [m, n]. 


4.5.29 (G, 0) be a finite abelian group. Let m be maximum among the orders of 
elements of G. Use the above exercise to show that a” = e foralla eG. 


4.5.30 Show that the semigroup End(G, o) of endomorphisms of an infinite cyclic 
group (G, 0) is isomorphic to the multiplicative semigroup Z of integers. Show 
further that the group Aut(G, 0) of automorphisms of (G, 0) is isomorphic to the 
multiplicative group {1, —1}. 


4.5.31 Let (G, 0) be acyclic group of order m. Show that the semigroup End(G, o) 
of endomorphisms of (G, 0) is isomorphic to the semigroup (Z,,, *). Show further 
that the group Aut(G, o) of automorphisms of (G, 0) is isomorphic to the group U,, 
of prime residue classes modulo m. 


Chapter 5 
Fundamental Theorems 


This chapter is devoted to some fundamental theorems such as Lagrange Theorem 
and Isomorphism Theorems. We also discuss the direct decomposition of groups into 
indecomposable groups. 


5.1 Coset Decomposition, Lagrange Theorem 


In this section, we partition a group into disjoint union of sets with the help of 
a subgroup such that each member of the partition contains the same number of 
elements as the subgroup, and thereby deduce that the order of a subgroup divides 
the order of the group. We also give some applications of this fundamental result, 
especially in number theory. 

Unless specified otherwise, a binary operation of a group will be denoted by 
juxtaposition. Instead of saying that (G, 0) is a group, we will simply say that G is 
a group. 

Let H be a subgroup of a group G. Define relations R! and R” on G as follows: 


R' = {(a,b)€GxG|a'be H}, 


and 
R" = {(a,b)€ Gx G|ab™ € H}. 


Here, / stands for left and r stands for right. 


Proposition 5.1.1 R! and R" are equivalence relations with G. 


Proof Reflexive: Since H is asubgroup x-!x = e belongs to H for all x € G. This 
shows that (x, x) € R! for all x € G. 
Symmetric: Suppose that (x, y) € R!. Then, x~!y € H. Since H is a subgroup, 
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(x~!y)-! = y7!x belongs to H. Hence, (y, x) € R!. 

Transitive: Suppose that (x, y) € R! and (y, z) € R!. Then, x~!y and y~!z both 
belong to H. Since H is a subgroup, x~!'z = x~!yy~!z belongs to H. This shows 
that (x, z) € R’. Similarly, we can prove that R” is an equivalence relation. tt 


Example 5.1.2 If H = {e}, then x~'y ¢ H means that x = y. Hence, in this case 
R' = R’ is the diagonal relation A with G. 


Example 5.1.3 The relations R’ and R" in general may be different. For example, 
consider the symmetric group $3 = Sym(X), where X = {1, 2,3} of degree 3. 
Let H = {I, p}, where J is the identity map on X, and p is the map defined by 
pC) = 2, p(2) = I,and p(3) = 3. Clearly, H is a subgroup of $3 (pop = I). 
Check that (f, g) € R’, where f and gare given by f(1) = 3, f(2) = 1, f() = 
2, g) = 1, g(2) = 3,and g(3) = 2. Also check that (f, g) does not belong to 
R’. 


Example 5.1.4 Let H be a subgroup of a group G contained in the center of G. 
Let (x, y) € R!. Then, x~'y € H. Since H is contained in the center, x-!y = 
x(x~!y)x7-! = yx7!. This means that (y,x) € R’. Since R’ is an equivalence 
relation, (x, y) € R’. Thus, R'C R’. Similarly, R” C R!. It follows that R! = R’. 


For a € G, the equivalence class R! of G modulo the equivalence relation R’ is 
given by 


Ri = {beG|(a,b)€ R'} = {hE Gl|a'be H} 


={beG|b = ah forsomeh € H}={ah |he H}. 


Definition 5.1.5 The set {ah | h € H} is denoted by aH and is called the left coset 
of G modulo H determined by a. The set {a | h € H} denoted by Ha is called 
the right coset of G modulo H determined by a. 


Thus, R! = aH is the left coset of G modulo H determined by a, and R’ = Ha 
is the right coset of G modulo H determined by a. Following the properties of left 
and right cosets can be easily verified. Indeed, these properties are also a direct 
consequence of Proposition 2.3.4. 

@) Liege = G. 

(ii) aH = DH ifand only ifa~'b € H. 

(iti) aH 4 bH ifand only ifaH (\bH = @. 

Also, 

(1) Uses Ha = G. 

(ii) Ha = Hb ifand only ifab™ € H. 

(iti) Ha £ Hb if and only if Ha(\ Hb = @. 

The quotient set G/R' = {aH | a € G} is the set of left cosets of G modulo 
H. This set is denoted by G/'H. Similarly, G/R’ = {Ha|a€ G} is the set of 
right cosets of G modulo H and is denoted by G/’ H. Recall that if R and S are 
equivalence relations on X, then 
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(i) R = Sifand only if Ry = S, forallx € X. 
(ii) R = Sifand only if X/R = X/S. 

Thus, 

(i) R' = R’ ifand only ifaH = HaforallaeéG. 
(ii) R'! = R" ifand only ifG/'H = G/'H. 


Proposition 5.1.6 There is a bijective map from G/'H to G/"H. 


Proof a~'b = a~'(b~')~!. Thus, to say that a~'b € H isto say thata~'(b7!)~! € 
H. Hence, aH = bH if and only if Ha~' = Hb™'. This shows that we have 
an injective map f from G/'H to G/’H given by f(aH) = Ha. Also any 
Ha = f(a"'A). Hence, f is surjective. t 


Remark 5.1.7 The correspondence aH ~» Ha need not be a map from G/!H to 
G/' H. Consider Example 5.1.3. It can be checked that fH = gH but Hf #4 Hg. 


Corollary 5.1.8 G/'H is finite if and only if G/" H is finite, and then they contain 
the same number of elements. tt 


Definition 5.1.9 Let H bea subgroup of a group G. If G/' H (or equivalently G/" H) 
is infinite, then we say that H is of infinite index in G. If G/'H is finite, then the 
number of elements in G/! H (which is the same as the number of elements in G/’ H) 
is called the index of H in G. The index of H in G is denoted by [G : H]. 


Thus, [G : H] is the number of left cosets of G modulo H(which is same as 
number of right cosets of G modulo #7). 


Example 5.1.10 Let m € N. Then, mZ is a subgroup of the additive group Z of 
integers. Since Z is abelian, every left coset of Z modulo mZ is also a right coset. 
Thus, Z/‘mZ = Z/"mZ). The left coset a + mZ of Z modulo mZ determined by a is 
givenbya+mZ = {a+mr|reéZ} = {bE Z|m/a—b}.Thus,a+mZ =a 
in Z,,. This means that Z/'mZ = Z, = Z/"mZ and the index [Z : mZ] of mZ in 
Zism. 


Example 5.1.11 Consider the Klein’s four group V4 and the subgroup H = {e,a} 
of V4. Then,eH = {e,a},aH = H = Ha (note thataH = A if and only if 
aéH), Also, bH = {b,c} = cH = Hb = He. Thus, V4/'H = Vi/'H = 
{{e, a}, {b, c}} and the index [V4 : H] = 2. 


Example 5.1.12 Consider the Quaternion group Qs and the subgroup H = {1, —1} 
of Qs. It is easy to observe that O3/'H = Qs/'H = {{1, -]}, {i, -7, (7, —J}, 
{k, —k}}, and so the index [Qg, H] of H in Qg is 4. Further, if K = {1, —1,i, —i}, 
then O3//K = Qs/’K = {{1,—1,i, —i}, {j, —j, k, —k}}, and so[Qx3 : K] = 2. 
Similarly, we observe that all other nontrivial proper subgroups of Qs are of index 2. 


Theorem 5.1.13 (Lagrange) Let H be a subgroup of a finite group G. Then, 


|G|=|H|[G: A] 
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Proof Since G is finite, the set G/' H of left cosets of G modulo H is finite. Suppose 
that | G/'H |= [G, H] = r.Let 


G/'H = {a,H = H,aH,a3H,...,a,H} 


Since the union of left cosets of G modulo H is G, we have 


G = aH JoH\|)---UaH (5.1) 


Since distinct left cosets are disjoint, a;H ()a;H = Q@ for all i £ j. Thus, by 
counting the elements of G in the above equation we get 


[Gl=|aH|+ |mH|+---+ |aH| (5.2) 


Next, let aH be any left coset of G modulo H. Then, the map h ~» ah from H to 
aH is clearly surjective (clear from the definition of aH), and it is also injective, for 
by the cancellation law, ah; = ahz implies thath; = hz. This shows that any left 
coset contains the same number of elements as H. Thus, from the above equation it 
follows that 


|G|=|H|r =| |[G, 4] q 


Corollary 5.1.14 Order of a subgroup (index of a subgroup) H of a finite group G 
divides the order of the group, and 
| G | 


CH= 
[ ] | i 


Remark 5.1.15 From Lagrange theorem, order of a subgroup divides the order of 
the group. Naturally, we have the following problem: Let G be a finite group and m 
divides the order of the group. Do we have a subgroup of G of order m? We shall 
see later that G need not have any subgroup of order m. If corresponding to every 
divisor of the order of a group G there is a subgroup of that order, then G is called a 
C.L.T group. 


Corollary 5.1.16 Let G be a group and | G | = n. Leta € G. Then, o(a) divides n 
(note that notation o(G) also stands for order of G.). 


Proof Since o(a) = o(< a >), the result follows from the Lagrange theorem. { 
Corollary 5.1.17 Let G be a group of order n anda € G. Then, a" = e 
Proof From the above corollary, o(a) divides n. The result follows. tt 


Corollary 5.1.18 Every group of prime order is cyclic. Indeed, in a group of prime 
order, every nonidentity element generates the group. 
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Proof Let G bea group and | G |= p, where p isa prime. Leta € G, a # e. Then, 
<a> {e}, and so o(<a>) > 1. By the Lagrange theorem, o(< a >) divides 
p. Hence, o(< a>) = p. But, thenG =< a >is acyclic group. tt 


The following theorem characterizes the cyclic groups of prime orders. 


Theorem 5.1.19 A group G has no nontrivial proper subgroups if and only if it is a 
cyclic group of prime order. 


Proof Suppose that| G |= p, where p isa prime. Let H ¢ {e} be a subgroup of G. 
Then, | H |> 1.Bythe Lagrange theorem,| H | / | G |. Hence,| H |= p =| G|, 
and so H = G. Conversely, suppose that G has no nontrivial proper subgroups. 
Leta € G, a #e. Then, < a > is a nontrivial subgroup of G. By our supposition, 
<a>z= G.Thus, G is cyclic. If G is infinite cyclic group, then by Corollary 4.5.28, 
it has infinitely many nontrivial proper subgroups. Hence, G = < a > is finite cyclic 
group. Suppose that | G | = m is not prime. Then, m = r-s for somer,s,1 < 
r<m,1< s < m.Thenclearly < a’ > is a subgroup of G of order s, and so 
it is a proper nontrivial subgroup of G. This is a contradiction to the supposition. 
Hence, m is prime. tt 


Theorem 5.1.20 (Poincare Theorem) Intersection of a family of finitely many sub- 
groups of finite indexes is again a subgroup of finite index. 


Proof Itis sufficient to show that the intersection of two subgroups of finite indexes is 
a subgroup of finite index. Let H and K be subgroups of G of finite indexes. Suppose 
thata(H () K) = b(H() K).Thena'b ¢ H() K.Inturn,aH = bHandaK = 
bK. Thus, (aH,aK) = (bH,bK). This gives us a map 7 from G/'(H (| K) to 
G/'H x G/'K defined by n(a(H () K)) = (aH, aK). Suppose that (aH, aK) = 
(bH,bK). Then, aH = bH andaK = bK. But, then a~'b € H()K. Hence, 
a(H ()\ K) = b(H()K). This shows that 77 is injective. Thus, if G/!H and G/'K 
are finite, then G/'(H () K) is also finite, and 


| G/'(H (| K) |s| G/'H|-| G/'K | 
This proves that if [G, H] < co and[G, K] < o, then 

[G, H( | K]<IG, H]-[G, K] < 0. 
Proposition 5.1.21 Let H and K be subgroups of a group G. Then, there is a 
bijective map from H/'(H (| K) to HK/'K = {aK |a € HK}. 


Proof Suppose that a(H()K) = b(H{()K), where a,b € H. Then, a-'be 
H () K. In particular, a-'b € K,andsoaK = bK. This shows the existence of 
a map 7 from H/'(H(\ K) to HK/'K defined by n(a(H ()\K)) = aK. Any 
element of HK/'K is of the form hkK for some h € H and k € K. Clearly, 
hkK = hK = n(h(H()XK)). This shows that 7 is surjective. Suppose that 
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n(a(H ()K)) = n(b(H()K)). Then, aK = bK. Hence, a~'be€ K. Since 
a,b € H, it follows that a~'b € H() K. Hence, a(H (\ K) = b(H()K). This 
shows that 7) is bijective. tt 


Corollary 5.1.22 Let H and K be finite subgroups of a group G. Then, H K is also 


finite, and 
| H|-| K | 


ae ta 
| H() K | 


Proof The map 7 from H x K to HK defined by n((h,k)) = hk is surjective. 
Hence, if H and K are finite, then HK is also finite. Since x € HK implies that 
xK C HK, HK isacomplete union of members of HK/'K. Thus, 


HK = U cnet: 


Since distinct left cosets are disjoint 
| HK |=|K|-|HK/'L | 


From the previous proposition and the Lagrange theorem, 


| H | 
|HK/'K|=|A/'H()K\|= [4,4()K] = ——— 
( ( | H()K | 
Thus, 
. [Kite 
(ae |= 
| H()K | 


Applications to Number Theory 


Theorem 5.1.23 (Euler—-Fermat) Let m be a positive integer and a an integer such 
that (a,m) = 1. Then 
a?) = 1(mod m). 


Equivalently, 
qm = T 


in Zin 


Proof Consider the group U,, of prime residue classes modulo m. Since (a,m) = 1, 
a €Uy,. Since | Un | = (m), by Corollary 5.1.17, a” = T,or equivalently, 
a®™) — | is divisible by m. This means that a?” = 1(mod m). t 


Remark 5.1.24 The above result asserts that if we divide a®”” by m, the remainder 
obtained is 1. 
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Corollary 5.1.25 (Fermat) Let p be a prime number which does not divide an integer 
a. Then 
a?~| = (mod p). 


, pe ans 
Equivalently, @?~" = 1in Zp. 


Proof The result follows from the above Euler—Fermat’s theorem, if we note that 
0(p) = p — 1 forall prime p (note that all positive integers less than p are co-prime 


to p). t 
Corollary 5.1.26 Let p be a prime number and a an integer. Then, a? = a(mod p). 


Proof If p does not divide a, then from the above Fermat’s theorem a?~! = 
1(mod p), and so a? = a(mod p). Next, if p divides a, then it also divides a? — a, 
and therefore, again a? = a(mod p). tt 


More generally, we have the following corollaries. 


Corollary 5.1.27 Let p be a prime number and it does not divide an integer a. Then 
i l(mod p’). tt 
Corollary 5.1.28 Jf p is a prime which does not divide a, then 


a’ = a” ' (mod p’). tt 


Illustrations 


1.1. Let G be a group, and H and K be subgroups of G such thataH = bK for 
some a,beG.Then,H = K. 


Proof Suppose thataH = bK.Then,a¢@aH = bK.Hence,a € bK. But, then 
ak = bK = aH.Sinceak = ah implies thatk = h,weseethatK = H. { 


1.2. LetG =< a> beacyclic group (not necessarily finite) and H a subgroup of 
index m. Then, H =<a"™ >. 


Proof If H = {e}, then m = [G, {e}] =|G|andsoa” = eandH =< 
a” >. Suppose that H # {e} and that H =<a' >, where r is the least posi- 
tive integer such that a” € H. Then, it is easy to show that a” € H if and only 


if r divides n. This means that a’'H = a‘H if and only if r divides s —tf. 
This shows that {eH,aH, aH,.. .,a’ | Hyis precisely the set (G/H)'. Hence, 
m = [G,H] =r. tt 


1.3. Let G; and G2 be groups of co-prime orders. Then, the only homomorphism 
from G, to G2 is the trivial homomorphism. 
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Proof Let f be a homomorphism from G; to G2. Let a € G;. Then, | a | divides 
| G; | and| f(a) | divides | G2 |. Also| f(a) | divides | a |. Hence, | f(a) | divides 
| G, | and | G2 |. Since | G; | and | G2 | are co-prime, it follows that | f(a) |= 1 
and so f(a) = e foralla € Gj. tt 


1.4. Let G be a finite group. Suppose that for each divisor d of | G |, the equation 
X? = ehasat most d solutions. Then, G is cyclic. 


Proof Suppose that | G | = n. Let d be a divisor of n. If there exists an element a 
of order d, then all elements of < a > are the solutions of the equation X’ = e. By 
our hypothesis, there cannot be more. In < a >, there are ¢(d) elements of order d. 
This shows that either G contains ¢(d) elements of order d or it contains no element 
of order d. Let A = {d EN | d divides n and G has an element of order d}. 
Since each element of G is of order a divisor of n, we have 


n=|Gl= >), ¢@) 


Since, 


n= Vin 


(by Theorem 4.5.37), it follows that A is the set of all divisors of n. In other words, 
corresponding to every divisor d of n, there is an element of that order. In particular, 
G has an element of order n. This proves that G is cyclic. tt 


1.5. Let G be a group of order p’ - g*, where p and q are distinct primes. Let P be 
a subgroup of order p’ and Q a subgroup of order g*. ThenG = P.-Q. 


Proof By Corollary5.1.22,| P-Q|= Pral , and since P () Q is a subgroup of 
P as well as Q, | P (\ Q | divides | P | as well as | Q |. Since p and q are distinct 


primes, | P()Q|= 1.Thus,| P-Q|= p'q’ =|Gl,andsoG = P-Q. ¢ 


1.6. Let G bea finite group and f an automorphism of G.LetX = {aeG| f(a) = 
a~'}. Suppose that | X | > 3 | G |. Then, Gis abelianand f(a) = a7! foralla € G. 


Proof Let b € X. We show that the centralizer Cg(b) of b is the group G itself. 
Consider b-'X = {b~'a|a € X}. Since the multiplication by any element of G is 
a bijective map from G to G, | b-'X | =| X |. Now, 


|b'X| + |X| - |b'X()xX|=lo'xU Xx ls G| 
Since | b-1X | =| X |> 3-| G|, 


| G | 


bX ( XS eo Xk X| — |G|>— 
| () x l= pee = eee 
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Let z € b-!X () X. Then, z, bz € X. Hence, z-'b-! = (bz)! = f (bz) (forbz€ 
X) = f(b)f(z) (for f is a homomorphism) = b-1z-! (for b, z € X). Tak- 
ing inverses bz = zb. Thus, each element of b~'X () X commutes with b. In 


other words, b-!X (| X C Cg(b), the centralizer of b. Since | b>! X () X | > aon 
| Cg(b) | > & and so by the Lagrange theorem Cg(b) = G. Thus, every element 
of X commutes with every element of G andso X C Z(G). Again, | Z(G) |=>| X |= 
3G and hence by the Lagrange theorem Z(G) = G. This shows that G is abelian. 
Further, then, X forms a subgroup(check it). Again, since | X | > ; | G |, by the 


Lagrange theorem X = G. tt 


Remark 5.1.29 The result in the above illustration is best possible. For example, 
consider Qg the Quaternion group of order 8. The map f from Qs to Qs given by 
f@ = 1, fC) = -1, f@M = - FO) = -4, FCO = 4 FC) = 
J, ftk) = k, f(—k) = —k is an automorphism for which | X | = 3 | Qs |, but 
Qs is not abelian. 


1.7. Let a be an integer greater than 1. Then, n divides p(a” — 1) for all n > 1, 
where ¢ is the Euler’s phi function. 


Proof Consider the group U,:_; of prime residue classes modulo a” — 1. Then, 


| Ugn_y | = O(a" — 1). Since a > 1 andn>1, (a,a"—1) = 1. Hence, dé 
Ugn_,. Also a" = 1, for a” — 1 divides a” — 1. Further, a” — 1 is not divisible by 
a" —1foranym < n. Hence, the order of @ in Ugn_, is n. The result follows from 
Corollary 5.1.16. tt 


1.8. The remainder obtained when 42°! + 187! + 7 is divided by 31 is 5, for using 
Fermat’s theorem, we have 


421 418947 = 22" 47847 = @)?-D+184+7 = M435 =5 


Exercises 


5.1.1 Let R be a relation on a group G such that R is a subgroup of G x G (recall 
that G x G is a group with respect to coordinate-wise operation). Show that R, = 
{x € G| (x, e) € R} is a subgroup of G. Let R and S be subgroups of G such that 
R. = S.. Can we conclude that R = S$? Let R be a reflexive relation on G which 
is a subgroup of G x G. Show that xhx~! € R, forall x € Gandh € R,. Let R be 
a symmetric relation on G which is a subgroup of G x G. Show that if x € R,, then 
x? € R.. Show that a map f from G to itself considered as a subset of G x G isa 
subgroup if and only if f is an endomorphism of G. 


5.1.2 Let G be a group of order 80. Show that it does not contain any subgroup of 
order 3. 


5.1.3, Give an example of an infinite group in which all nontrivial subgroups are of 
finite indexes. 
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5.1.4 Give an example of an infinite group in which all proper subgroups are of 
finite orders. 


5.1.5 Let H and K be subgroups of a finite group G such that H C K. Show that 
[G, H] = [G, K][K, H]. 


5.1.6 Suppose that [G, H] = m and [G, K] = n. Show that [m,n] divides 
[G, H (| K]. When does equality hold? 


5.1.7 Let f be a homomorphism from a finite group G, to a finite group G2. Show 
that | f(G1) | divides | G2 | and | ker f | divides | G, |. 


5.1.8 Show that every finite cyclic group isa C.L.T group. 


5.1.9 Let S be a nonempty subset of a group G. Define a relation ~ on G by 
a ~ bifand only ifa~'b € S. Show that ~ is an equivalence relation if and only if 
S is a subgroup. 


5.1.10 Let H be a subgroup of G. Show that R’ is a subgroup of G x G if and only 
if R' = R’ (equivalently,aH = Ha foralla€G). 


5.1.11 Let H and K be subgroups of a finite group G. Suppose that | H |> | G | 
and| K |> ./| G |. Show that H () K # fe}. 


5.1.12 Suppose that | G |= p-q, where p and q are distinct primes and p > q. 
Use the above exercise to show that there can be at most one subgroup of G of order 


p. 


5.1.13 Show that (a+ b)? = (a? + b?)(mod p) for all a,b € N, where p is a 
prime number. Hint: Use the Binomial theorem. 


5.1.14 Let H and K be subgroups of G. Show that a(H (| K) = aH (\aK for 
alla €G. 


5.1.15 Suppose that [G, H] and [G, K] are finite and co-prime. Show that 
[G,H(\K] = [G,H]-[G, K]. Suppose further that G is finite. Show that 
G = HK. 


5.1.16 Let G be a group having just one proper nontrivial subgroup. Show that G 
is a cyclic group of order p? for some prime p. 


5.1.17 Find the remainders when 
(i) 28!° + 5 is divided by 17. 
(ii) 1474 + 5 is divided by 13. 
(iii) (427° + 6)!° is divided by 11. 
(iv) 3*7 is divided by 23. 
(v) (18! + 7)°6 + 2 is divided by 19. 


5.1.18 Show that (10! + 5) is of the form 1+ 11k for some k > 0. 
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5.1.19 Show by means of an example that a”~! = 1(mod m) need not imply that 
m is prime. 


5.1.20 Find #(1000). 


5.1.21 Let G be a group and H a subgroup of G. A subset S$ obtained by selecting 
one and only one member from each right(left) coset of G modulo H with the choice 
e from the coset H is called a right(left) transversal of G modulo H. Let H bea 
subgroup of K and K a subgroup of G. Let T be a right transversal of K in G and 
S aright transversal of H in K. Show that S - T is a right transversal to H in G. 


5.1.22 Let G be a finite group and H a subgroup of order m and index r. Find the 
number of right transversals to H in G. 


5.2 Product of Groups and Quotient Groups 


Let G;, Go,..., G, be groupsandG = G; x G2 x --: x G,, the Cartesian product 
of the sets G;, Go, ..., G,. Define a binary operation * on G by 


axb = (a,b, dobo,..., a,b,), 


wherea = (dj, d2,...,@,) andb = (by, bo,..., by). The element (e), 2, ..., @n) 
is the identity, and (ar lin eas a;,') is the inverse of (a1, d2,..., Gy). This group 
is called the Cartesian product or the external direct product of G,, G2,..., Gn, 


and it is denoted by [| |/_, G;. 


Theorem 5.2.1 LetG = Ty G;, be the external direct product of G,, G2, ..., Gn. 
Let 


Ay = {(€1, €2,---, Ck-1,4, Ck41,---,€n) | a € Ge}, K=1,2,...,0n. 


Then, the following holds: 
(i) Hy is a subgroup of G, for all k. 
(ii) Hy is isomorphic to G, for all k. 
(ili) Every element of Hy, commutes with every element of H; for allk # 1. 
(iv) Every element g € G has a unique representation as 


g = hyhg-+-hy, hy © Hy. 


(v) (iyhy +++ hy) = (hy) = yh hgh Ral, ; 


The proof of the above theorem is straight forward, and it is left as an exercise. The 
map 7% from G_; to Ay given by m(a) = (e1, €2,.--, Ck—-1, G, Ck+1, +++, En) is the 
required isomorphism. 
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Definition 5.2.2 A group G is said to be the internal direct product of its subgroups 
A, Ho, ..., Hy, if 

(i) each element of H;, commutes with each element of H; for all k #1, and 

(ii) every element g of G can be expressed uniquely as 


j= hyhz---hy, hy € Ay. 
Thus, the external direct product G = [Jj_, G; is the internal direct product 


of its subgroups H), Ho, ..., H,, described in the previous theorem. Conversely, we 
have the following proposition. 


H 
i 


Proposition 5.2.3 If G is the internal direct product of its subgroups H\, H2,... 
H,, then G is isomorphic to the external direct product []}_, Hi. 


Proof The map 7 from dem H; to G defined by n((Ay, h2,...,An)) = Ayho---hy 
is easily be seen to be an isomorphism. ft 


Proposition 5.2.4 Let G be a group. Then, G is the internal direct product of its 
subgroups H,, H2,..., H, if and only if the following holds. 

(i)G = H,Hy--- Hy. 

(ii) hh, = hyh, for allh, € Hy and h; € Hi, k # l. 

(iii) Ay (A Ae - +» Ap—1 Ae Aga) +++ An) = {e} for all k. 


Proof Suppose that G is the internal direct product of its subgroups H,, Hp, ..., Ay. 
The conditions (i) and (ii) follow from the definition of the internal direct product. 
To prove (iii), let g € Ay (\(A Ap: + Ay—-1 Hei +++ An). Then, g = e1e@2-+- ex-1 
ger+1°** en, Where eache; = e the identity of G. Again, since g € (H, Ho--+ Ak 
Agai:++ An), g = hyho-++hp_yehy41-+-+-hy. By the uniqueness of the representa- 
tion, g = e. This proves the condition (iii). 

Conversely, assume that the conditions (i), (ii), and (iii) of the proposition hold. 
By (i), every element g of G can be expressed as 


g = Ayho--++hy, hy € A; 
For the uniqueness of the representation, suppose that 
g = hyhz-++hy = hh. P,, hy, hy € A. 


From (ii), it follows that H, H2--- Ay_|Ay.,---H, is a subgroup of G for all k. 
Again, by (ii), 

ny hi, = (hyhz+ ++ hp ieg Fn) (A Ay Meh WT. 
This shows that h,'h, belongs to A, (.) H, H2--+ Ay, Ay, ---H, for all k. By the 
condition (iii) of the proposition, hy hy, = e forall k. Hence, hy = h’, for alli. 
This proves the uniqueness of the representation. tt 
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In view of the above discussion, we need not to distinguish internal and external 
direct products. 


Proposition 5.2.5 The direct product of any two nontrivial cyclic groups is cyclic if 
and only if they are finite and of co-prime orders. 


Proof Let G be a group which is a direct product of a nontrivial cyclic group H = 
<a > and a nontrivial cyclic group K = < b >. G cannot be infinite cyclic group. 
For, if G =< c > is infinite cyclic, thena = c' andb = c” for some! 4 0 and 
m # 0. But, then c™ EH () K. Since G, and so c is of infinite order cin 4 e. This 
contradicts the supposition that G is a direct product of H and K. Thus, if the direct 
product of H and K is cyclic, then both H and K should be finite. Now suppose that 
| H|=|a|=rand| K |=|b|= s.Then, G is cyclic if and only if | a'b/ |= rs 
for some i, j. Since the elements of H and K commute and also H (|) K = {e}, by 
Proposition 4.5.17, | a'b/ |= [| a! |, | b/ |] (the least common multiple of the order 
of a‘ and the order of b/). Now, by Proposition 4.5.19, | a’ |= ean) and | b/ |= ane 
Thus, G is cyclic if and only if there exist i, j suchthat| a'b/ |= Leora) =I 
But, the above equality holds if and only if (7,i) = 1 = (s, j) and(r,s) = I. 
(Note that ). Hence, the above conclusion. tt 


r Ss < a Ss 
Fea : cw! — i) (6.7) 
Definition A group G is said to be indecomposable if it cannot be written as a direct 
product of two nontrivial proper subgroups. 


Example 5.2.6 Since any two nontrivial subgroups of an infinite cyclic group inter- 
sect nontrivially, an infinite cyclic group cannot be a direct product of two nontrivial 
proper subgroups. Thus, an infinite cyclic group is always indecomposable. 


Example 5.2.7 A finite cyclic group is indecomposable if and only if it is of prime 
power order. For, if G is a cyclic group of prime power order, then no two nontrivial 
proper subgroups of G are of co-prime orders and so from Proposition 5.2.5, G is 
indecomposable. Next, suppose that G = < a > is not of prime power order. Sup- 
pose that| G |= m.Then,m = mymz2,wherel < mj, 1 < mzand(m,,m2) = 1. 
Take H =<a™' >andK =< a" >.Then,| H |= mzand| K |= m, areofco- 
prime orders, and so by the Lagrange theorem H () K = {e}.But,thenG = HK. 
Clearly, the elements of H and K commute. Suppose that hjk; = h2k2. Then, 
hy'hy = ky'ki © H(\K = {e}. Hence, hy = hz andk; = ky. This shows that 
every element g of G has a unique representation as g = hk. This means that G is 
a direct product of H and K, and so it cannot be indecomposable. 


Example 5.2.8 The additive group Q of rational numbers is indecomposable. For, 
let H and K be nontrivial subgroups of Q. Letr = 7 #0 be a member of H and 
s= & # 0 be a member of K. Then, mk € H (| K and so H() K 4 {0}. Hence, 
Q cannot be a direct product of H and K. 


Based on the following analogy, the language of group theory can be developed 
as the language of set theory was developed. We shall illustrate it very briefly, and 
the reader may complete the details as exercises. 
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Sets <—> Groups. 

Subsets <—> Subgroups. 

Cartesian products <—> Direct products. 

Relation R on a set X <—> Relation R on a group G which is a subgroup of 
GxG. 

Equivalence relation on a set X <—> Equivalence relation on a group G which 
is also a subgroup of G x G. 

Map from a set X to a set Y <—> Homomorphisms from a group G to a group 
G'. 

Note that a map f from a group G to G’ is a homomorphism if and only if f 
considered as a subset of G x G' is a subgroup of G x G’. 


Definition 5.2.9 Let G be a group. An equivalence relation R on G is called a 
congruence on G if it is a subgroup of G x G. 


Now, we discuss the relationship between the congruences on G and subgroups 
of G. 


Proposition 5.2.10 Let R be a congruence ona group G. Then, 

(i) Re = {ae G| (a,e) € R} is a subgroup of G. 

(ii)aRe = Ra = Ra forallaeG. 

(iii) R is uniquely determined by the subgroup R-. In fact, (a, b) € R ifand only 
ifa~'b € R.(ifand only ifab™' € R.). 


Proof (i) R, being congruence, is an equivalence relation on G. Hence (e, e) € R, 
and so e € R,. Thus, R. 4. Let a, b € R,. Then (a, e), (b, e) € R. Since R is a 
subgroup of G x G, (a, e)(b,e)~! = (ab7!, e) € R. This means that ab~! € R.. 
Thus, R, is a subgroup of G. 

(ii) Let x € Rg. Then, (a, x) € R. Since R is an equivalence relation, 
(a-',a~') E R. Since R is a congruence, it is a subgroup of G x G, and hence 
(a-',a~!)- (a,x) = (e,a7'x) € R. This shows that a~!x € R,, or equivalently 
x €aR,. Conversely, if h € R,, then (ah,a) = (a,a)- (h,e) belongs to R, for R 
is a congruence. This shows that ah € R,. 

(iii) Since R is a congruence, to say that (a,b) € R is equivalent to say that 
(e,a~'b) = (a~!,, a7!) - (a, b) belongs to R. Thus, (a, b) € R ifand only ifa~'b € 
R,. Similarly, one can show that (a,b) € R if and only ifab~! € R.. tt 


Remark 5.2.11 The above proposition says that R’ = R” is a necessary condition 
for a subgroup H to be R, for some congruence R on G. The following proposition 
says that the condition is also sufficient. 


Proposition 5.2.12 Let H be a subgroup of a group G. Then, R' is a congruence if 
and only if R! = R’. Further, then H = Ri, 


Proof Suppose that R! is a congruence. To say that (a, b) € R! is equivalent to 
say that (a~',b-'!) = (a,b)~! € R’. This is equivalent to say that ab“! = 
(a~!)-'b-! € H. Hence, R' = R’. Thus, if R! is a congruence, then R! = R’. 
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Conversely, suppose that R’ = R”. Then, we have to show that R’ is a congruence. 
Itis already an equivalence relation. Thus, it is sufficient to show that it is a subgroup. 
Since R' = R’ and a7!ah € H for all h € H, it follows that aha7! € H for all 
a é€Gandh € H. Now, let (a, b), (c,d) € R'. Then, a~'b and c7!d belong to H. 
Hence, (ac)"'!bd = c7!a~'bd = c~'dd7'(a~'b)d belongs to H. This shows that 
(ac, bd) € R'. Already we have seen that the inverses of the elements of R! are the 
elements of R’ = R’. It follows that R! is a congruence. Finally, it is easily observed 
that R! = H. tt 


For further analogy between set theory and group theory, consider a group G and 
congruence R on G. Consider the quotient set 


G/R = {R,|a€G} = {aR, |a€ G} = G/'R. = G/'R. 


If (a,c), (b, d) € R, then (ab, cd) € R, for R is a subgroup. Thus, R, = R, and 
R, = Raq implies that Ra, = R-a. This shows that there is a unique binary operation 
xonG/R = G/'R, = G/"R, given by 


Ra * Rp = Rab 


It is easily seen that (G/R, x) is a group. R-; is the identity and the inverse of R, is 
R,-. This group is called the quotient group of G modulo R. 


Normal Subgroups and Quotient Groups 


Definition 5.2.13 A subgroup H of a group G is called a normal subgroup 
or invariant subgroup ifaH = Ha forallaeG. 


Before having some examples, let us have some necessary and sufficient condi- 
tions for a subgroup to be a normal subgroup. 


Theorem 5.2.14 Let H be a subgroup of a group G. Then, the following conditions 
are equivalent: 

1. H is anormal subgroup of G. 

2 = i, 

3.G/H = G/'H. 

4. R' (R')isa congruence on G. 

5. The correspondence which associates aH with Ha is a map from G/'H to 
G/'H. 

6. (a, b) € R! if and only if (ag, bg) € R' forall g € G. 

7.aha"' € H foralla€ Gandhe H. 

8 aHa! = H forallaeG. 

9. The binary relation* on G/'H givenbyaH *bH = abH isabinary operation 
on G/'H. 


Proof 1 <=> 2. We know that R! = aH and R’ = Ha. Thus, to say that H is 
normal is to say that Ri. = R' foralla € G. This is equivalent to say that R' = R’, 
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2 <=> 3. This follows if we observe that two equivalence relations on a set X are 
same if and only if the corresponding quotient sets are same. 

3 <=> 4. This is precisely Proposition 5.2.12. 

1 <> 5. Clearly, 1 implies that aH ~ Ha is the identity map. Assume 5. Then, 
aH = bH implies that Ha = Hb. Equivalently, a~'b € H implies that ab“! € 
H. This means that R! C R". Since the inverses of the elements of R! in G x G are 
precisely the elements of R’, and those of elements of R’ are precisely the elements 
of R', it follows that R’ C R!. Thus, R'’ = R". This completes the proof of the fact 
that 1 <=> 5. 

4 => 6. Assume 4. Since R! is a congruence, it is a subgroup, and (g, g) € R! 
for all g € G. Hence, (a, b) € R! if and only if (a, b) « (g, g) € R'. 

6 => 2. Assume 6. Then, (a, b) € R! if and only if (ab~!, bb~') € R'. In other 
words, a~'b € H if and only ifab~| € H. This shows that R! = R’. 

1 = > 7. Assume 1. Then,aH = Ha foralla € G.Leta € Gandh € H. Then, 
ah €aH = Ha.Hence,ah = ka forsomek € H. But, thenaha~! = k € H. 

7 => 1. Assume 7. Let ah € aH, whereh € H. Then, aha! € H andsoah = 
aha—'a € Ha. Thus, aH C Ha. Further, ha = a(a~'ha) € aH for alla € G and 
h € H. This shows thataH = Ha. 

7 => 8. Assume 7. Then, aha~! and a~'h(a~')~! belong to H for alla € Gand 
h € H. This shows thataHa~! = H. 

8 => 7 is obvious. 

Finally, we prove that 7 <> 9. Assume 7. Then, ghg™! € H for all g € G and 
h € H. Suppose thataH = bH andcH = dH.Then,a'be Handc ld e H. 
But, then (ac)~!bd = c7!a~'!bd = c7'(a7'b)c(c7'd) belongs to H. This shows 
thatacH = bdH. Thus, x defined by 


aHxcH = acH 


is a binary operation, and so 7 implies 9. Assume 9. Let a € G andh € H. Then, 
hH = eH. Since x is a binary operation 


haH = hHxaH = eHxaH = aH. 


This shows that a~'ha € H forall a € G and for all h € H. Thus, 9 => 7. tt 


Remark 5.2.15 We have shown the existence of a natural bijective map ‘t from the 
set C(G) of congruences on G to the set N S(G) of all normal subgroups of G defined 


Notation. We use the notation H <I G to say that H is a normal subgroup of G. 
Let G be a group. Then, G and {e} are always normal subgroups of G (verify). 
The normal subgroup G of G is called the improper normal subgroup and {e} is 
called the trivial normal subgroup of G. Other normal subgroups are called proper 
normal subgroups. A group G is called Simple if it has no proper normal subgroups. 
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Remark 5.2.16 A simple group is not really simple to study. 


Notation. The notation H <j G is used to say that H is a proper normal subgroup 
of G. Thus, 


H <1G ifand only if(H IGand HAG, H F {e}) 


If H is a subgroup of a group G which is contained in the center of G, then it 
is normal in G, foraH = {ah|heH} = {ha|heH} = Ha. In particular, 
every subgroup of an abelian group is normal. We may ask the following converse 
of the above statement: Let G be a group all of whose subgroups are normal. Can 
we infer that G is abelian? The answer is no. Consider the Quaternion group Qs. {1} 
and Qg are clearly normal. The subgroup {1, —1} is the center, and so it is normal. 
All other subgroups are of order 4, and so of index 2. They are all normal because 
of the following proposition: 


Proposition 5.2.17 Every subgroup of index 2 is normal. 


Proof Let G be a group and Ha subgroup of index 2. We have to show thataH = 
Ha forallaeG.IfaeH,thenaH = H = Ha. Suppose that a ¢ H. Then, 
aH #H # Ha. Since H is of index 2,G/'H = {H,aH}andG/"H = {H, Ha}. 
Since Left(right) cosets form a partition of G,aH = G—H = Ha.Thus,aH = 
Ha for all a € G, and so H is normal in G. tt 


Remark 5.2.18 We observed that Q¢ is a nonabelian group all of whose subgroups 
are normal in G. Baer (Situation der untergruppen und struktur der group, S.B. 
Heidelberg Akad. Mat. Nat, Klasse 2, 1933, 12-17) in 1933 proved that a nonabelian 
group has all its subgroups normal if and only if it is a direct product of Qs, several 
copies of Z2, and an abelian group all of whose elements are of odd order. In particular, 
all subgroups of Qg x Zz are normal. 


Remark 5.2.19 Every subgroup of index 2 is normal. The result is not true for any 
other prime. However, the result can be generalized as follows: If p is the smallest 
prime dividing the order of G, then every subgroup of index p is normal. The proof 
of this fact will be given later. 


Proposition 5.2.20 An abelian group is simple if and only if it is prime cyclic. 


Proof Since every subgroup of an abelian group is normal, an abelian group is simple 
if and only if it has no nontrivial proper subgroups. But, then, it is necessarily prime 
cyclic (Theorem 5.1.19). tt 


Proposition 5.2.21 Let f be a homomorphism from a group G, to a group G2. Let 
Hy <1 Gp. Then, f~'(Hy) <1 Gy. Suppose further that f is surjective and H, <1 G,. 
Then, f (M1) < Go. 
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Proof We already know that f~!(H2) is a subgroup (Proposition 4.4.25). Let 
h € f-'(Hb) andg € G;.Then f(ghg') = f(g) f(h) f(g) !.Sinceh € f~'(Ab), 
f(h) € Hp, and again since Hy < Gy, f(g) f(h) f(g)~! € Ho. Thus, f(ghg7') € 
H. But, then ghg~! € f~!(H). Since g and h are arbitrary, f~'(H2) < G). 

Next, suppose that f is surjective and H; < G;.Let f(h) € f(A), whereh € Hy. 
Let x € Gp. Since f is surjective, there exists g € G, such that f(g) = x. But, 


then xf (h)x7! — fofaf@g! = f(ghg"') (for f is a homomorphism). 
Since H, < Gi, ghg~' € Hy, and so xf (h)x~! € f()). This shows that f(H,) is 
a normal subgroup of Go. tt 


Corollary 5.2.22, Under the map ¢ defined in Theorem 4.4.29 (Correspondence The- 
orem), normal subgroups correspond. tt 


Corollary 5.2.23 Kernel of a homomorphism is a normal subgroup. tt 


Proof Let f be a homomorphism from a group G, to a group G2. Since {e} is a 
normal subgroup of Go, it follows that the ker f = f~'(f{e2}) is anormal subgroup 
of G ie t 


Proposition 5.2.24 Let H and K be normal subgroups of a group G. Then, the 
following holds. 

(i) HK is anormal subgroup. 

(ii) If H(\K = {e}, thenhk = kh forallh € Handke K. 


Proof (i) Since K isnormal HK = UneyhK = Uney Kh = KH. This shows 
that HK is a subgroup of G. Further, let hk ¢ HK, where h € H andk € K. Since 
H and K are normal in G, ghg™! € H and also gkg™! € K. Hence, ghkg"! = 
ghg~'gkg7' also belongs to H K. This shows that HK is normal in G. 

(ii) Leth € Handk € K.Since H isnormalinG,khk~! € H,andsohkh7'k7! © 
H. Similarly, hkh~'k~! € K. Since H() K = {e}, it follows that hkh—'k-! = e. 
This means that hk = kh. 


= 


Corollary 5.2.25 Let G be a group. Then, G is the internal direct product of its 
subgroups H,, Hy, ..., H, if and only if the following hold. 

(i)G = H,H--- Ay. 

(ii) Hy. is normal for each k. 

(iii) A (\(A Ap otk Ay Ag ra HA,) = {e} for all k. 


Proof Since Hy (\(Ai Ao... Ae—1 Aki... An) = {e} forallk, H, 1) Hy = {e} for 
all k #1. From the above proposition, for k 4 /, each element of H; commutes with 
each element of H;. The result follows from Proposition 5.2.4. tt 


Theorem 5.2.26 Let H < G. Then, we have a binary operation x on G/H defined 


by 
aHxbH = abH 


with respect to which G/H is a group. 
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Proof By Theorem5.2.14(9), x is a binary operation on G/H. The verification of 
the associativity of « is straightforward. The identity is eH = H, and the inverse 
of aH isa~'H. tt 


Definition 5.2.27 The group (G/H, «) described in the above theorem is called the 
quotient group or the factor group of G modulo H. 


Check that Z/mZ is the group Z,, of residue classes modulo m. 


Definition 5.2.28 Let H < G. The map v from G to G/H defined by v(a) = 
is called the quotient map from G to G/H. 


It follows from the definition of the binary operation in G/H that the map v is 
a surjective homomorphism. Further, the kernel of v is {a € G| v(a) = H} (note 
that the identity of G/H is H)={a € G| aH = H}.NowaH = H ifand only 
if a € H. This shows that kernel of v is H. This also says that normal subgroups are 
precisely kernels of homomorphisms. 


Proposition 5.2.29 Let H <1 G. Then any subgroup of G/H is of the form K/H, 
where K is a subgroup of G containing H. Further K,i/H = K»2/H if and only if 
K, = Ko. The subgroup K/H is normal in G/H if and only if K is normal in G. 


Proof The quotient map v is a surjective homomorphism from G to G/H whose 
kernel is H. Thus, by correspondence theorem, v induces a bijective map from the 
set S(G) of subgroups of G containing H (the kernel of 1) to the set of all subgroups 
of G/H given by K ~ K/#H under which normal subgroups correspond. The result 
follows. t 


Example 5.2.30 Consider the quotient map v from Z to Z/mZ = Z,,. It follows 
from the above proposition that every subgroup of Z,, is of the form rZ/mZ, 
where mZ CrZ. Now mZ C rZ if and only if r divides m. Clearly, rZ/mZ = 
{0, 2r,...,(¢ — lr}, where gr = m 


Theorem 5.2.31 (Cauchy Theorem for Abelian groups) Let G be a finite abelian 
group. Let p be a prime dividing the order of G. Then, G has an element (and so 
also a subgroup) of order p. 


Proof The proof is by induction on | G |. If| G | = 1, the statement is vacuously 
true. Assume that the result is true for all those groups whose order is less than the 
order of G. We have to prove the result for G. If p does not divide the order of G, then 
the result is vacuously true. Suppose that p divides the order of G. Leta € G, a #e 
and order of a is m. If p divides m, then order of a? is p and we are done. Suppose 


that p does not divide m. Then, (p,m) = 1. Since G is abelian < a > is normal in 
G. Further, 
(i)| G/<a>|= <|G|. 


= 


(ii) p divides | G/ < a >|, for p divides | G | and(p,|a|) = 1. 
By the induction hypothesis, there is an element b < a > of G/ < a > of order 
p. Thus, 
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b<a>#<a>and(b<a>)? =<a>. 


Equivalently, b ¢< a > andb? e<a>.Ifb? = e,bisanelement of order p, and 
we are done. Suppose that b? = a’ #e. Then 


(bP) = (a’)” =— qh =e. 
Hence, (b”)? = e. Suppose that b” = e. Then 
(b<a>)” = b” <a>=<a> 


the identity of G/ < a >. Since b < a > is of order p, p divides m, a contradiction 
to the supposition that (p,m) = 1. Thus, b” is a nonidentity element which is of 
order p. ft 


Theorem 5.2.32 Let G be a group such that G/Z(G) is cyclic. Then, G is abelian 
and G/Z(G) is the trivial group. 


Proof Suppose that G/Z(G) is cyclic and is generated by aZ(G). Then, G/Z(G) = 
{(aZ(G))" |n € Z} = {a"Z(G) | n € Z}. Since G is the union of its cosets, 


G= Uv’ 2@ 


Now, let x, y € G. Then, x = a”u and y = av for some m,n € Zandu,veé 
Z(G). Since u and v are in the center of G, 


This shows that G is abelian, and so G/Z(G) is the trivial group. tt 


Definition 5.2.33 Let G be a group. An element of the form aba~'b~! is denoted 
by (a,b) and is called a commutator. The subgroup G’ of G generated by all 
commutators of G is called the commutator subgroup or the derived subgroup 
of G. 


Theorem 5.2.34 The commutator subgroup G’ of G is a normal subgroup of G 
such that G/G' is abelian. Further, let H be a normal subgroup of G. Then, G/H 
is abelian if and only if G’ © H. Also if H is any subgroup of G containing G’, then 
it is normal in G. 


Proof Leth € G’ and g € G. Then, ghg~'h~! being a commutator, belongs to G’. 
But, then ghg-! = ghg~'h~'h belongs to G’. Hence, G’ is a normal subgroup of 
G. The above argument also shows that if H is any subgroup containing G’, then 
itis normal in G. Now, abH = aHxbH = bHxaH = bdaH foralla,beG 
if and only if (ba)~'ab = a~'b~‘ab belongs to H for all a, b € G. It follows that 
G/H is abelian if and only if all commutators of G are in H. Since commutators 
generate G’, G/H is abelian if and only if G’ C H. tt 
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Remark 5.2.35 It follows from the above theorem that G’ is the smallest normal 
subgroup of G such that G/G’ is an abelian group. The quotient group G/G’ is 
the largest quotient group of G which is abelian. The group G/G’ is called the 
abelianizer of G and is denoted by Gap. 


Group with operator 


Definition 5.2.36 Let Q be a set and G a group. We say that Q operates on G 
through a map « from G x Q to G if 


abxw = (axw)(b*w) 


for allw € Q anda, b € G, where a x w denotes the image of (a, w) under «. We 
also say that G is a Q group through the operation *. Sometimes we also say that the 
pair (G, Q) is a group with operator. 


Let (G, (2) be a group with operator. Then for each w € Q, we have a map ff, 
from G to G given by f.,(g) = g*w. Since 


f.(ab) = abew = (axw)(bew) = fila fu) 


for alla, b € G, it follows that the map f,, is an endomorphism of G. Thus, we have 
amap f from Q to the set End(G) of endomorphisms of G defined by f(w) = fu. 
Conversely, given a map f from © to the set End(G), & operates on G through « 
givenbyaxw = f(w)(a).Let G; and G2 betwo Q groups with the operations *; and 
*2, respectively. A homomorphism ¢ from G, to G2 is called a 2— homomorphism 
if 

daw) = (a) 2 wW 


for alla € G, andw € Q. Equivalently, a homomorphism @ from G, to G2 is a Q— 
homomorphism if dof, = f,0¢forallw € Q. Clearly, the composite of any two Q— 
homomorphisms is a &2— homomorphism. As usual an injective &— homomorphism 
is called a 2— monomorphism and a surjective 2— homomorphism is called a Q— 
epi morphism. A bijective — homomorphism is called a Q— isomorphism. A 
subgroup H of a Q— group G is called a Q— subgroup of G ifhx*xw e€ H for all 
h € AH andw ¢€ Q. In turn, a Q— subgroup H of G is a Q— group at its own right 
such that the inclusion map is a 2— monomorphism. A normal Q— subgroup H of 
G is called a &2— normal subgroup. If H is a Q— normal subgroup of a Q— group 
G, then the quotient group G/H also becomes a Q2— group with respect to the Q— 
operation * given by aH *w = (ax*w)H such that the quotient map v from G to 
G/H given by v(a) = aF is a Q— homomorphism. This group is called the Q— 
quotient group. All the earlier relevant results hold good if we replace groups by Q— 
groups, subgroups by Q— subgroups, homomorphisms by Q— homomorphisms, 
normal subgroups by Q— normal subgroups and quotient groups by &2— quotient 
groups. The verification to this effect is left as exercise. 
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Let G be a group and Q C End(G). Then, G is a Q— group in a natural way. A 
EndG subgroup of G is called a fully invariant subgroup of G. More precisely, a 
subgroup H of G is called a fully invariant subgroup of G if 7(h) € H for all 7 € 
End(G) andh € H. An Aut(G) subgroup of G is called a characteristic subgroup 
of G. Thus, a subgroup H of G is a characteristic subgroup if 7(h) € H for all 
1 € Aut(G) and h € H. A subgroup A is a Inn(G) subgroup if f,(h) ¢ A for 
all h ¢ H and g € G (f, being the inner automorphism determined by g). Thus, 
Inn(G) subgroups are precisely normal subgroups of G. It is evident that a fully 
invariant subgroup is a characteristic subgroup, and a characteristic subgroup is a 
normal subgroup. However, reverse implication is not true. 


Example 5.2.37 The subgroup {e, a} of the Klein’s four group Vs = {e, a,b, c} is 
anormal subgroup. However, it is not a characteristic subgroup, for it is not invariant 
under the automorphism @ of V4 which maps e to e, a to b, b to c, and c to a. 
Indeed, V4 is characteristically simple in the sense that it has no nontrivial proper 
characteristic subgroup. 


Example 5.2.38 Let Z(G) denote the center of G and 7 an automorphism of G. 
Let a € Z(G) and x € G. Then, x = 7(y) for a unique y € G. Now, n(a)x = 
nany) = nay) = nya) = n(y)n(@) = xn(a). This shows that (a) € Z(G). 
Thus, the center of a group is always a characteristic subgroup of the group. However, 
the center of a group need not be fully invariant. Consider, for example, the group 
Qs x Z4, where Qg denotes the Quaternion group. Clearly, 


Z(Qs x Za) = {1,-1} x Zy 


Take an isomorphism 7 from Z, to the subgroup {1, i, —1, —i} x {0} of Og x Z4. 
Let pz denote the second projection. Then, Top2 is an endomorphism of Qg x Z4 
which maps the center {1, —1} x Z,4 tothe subgroup {1, 7, —1, —i} x {O} of Og x Z4 
which is not contained in the center. This shows that center of a group need not be 
fully invariant. 


Example 5.2.39 The commutator subgroup G’ = [G, G] of a group is generated by 
the set {(a,b) = a~'b-'ab | a,b € G} of commutators of G. If 7 is an endomor- 
phism of G, then n((a,b)) = (n(a))'(n(b))~! nan) = (na), n()) is again a 
commutator. This shows that the commutator subgroup G’ of G is fully invariant in 
G. More generally, if H and K are fully invariant subgroups of G, then the subgroup 
[H, K] =<{h-'k'hk |h eH, k € K} > is also fully invariant. 


Example 5.2.40 All subgroups of a cyclic group are fully invariant (verify). Are 
there more groups all of whose subgroups are fully invariant? 


Let Q C End(G). Recall that an endomorphism 7 of G is a Q— endomorphism 
if 7 commutes with all members of Q. A Jnn(G)— endomorphism of G is called a 
normal endomorphism. Thus, an endomorphism 77 of G is a normal endomorphism 
if nof, = f,on for all g € G. More precisely, an endomorphism 7) of G is a normal 
endomorphism if and only if 
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nigxg') = gn(x)g™' 


for allg,x EG 


Proposition 5.2.41 An endomorphism a of G is anormal endomorphism if and only 
ifg-'o(g) € Cg(a(G)) forall g € G. Indeed, then the map n from G to Cg(a(G)) 
defined by n(g) = o(g~')g is a homomorphism. 


Proof Let o be an endomorphism of G. Suppose that o is a normal endomorphism. 
Then by the definition, 


a(gxg"') = ga(x)g 'forallg, x € G. 


In turn, 
a(g)o(x)(o(g))"| = go(x)g ‘forall g,x € G. 


Hence, 
g 'o(g)a(x) = a(x) g 'o(g)for allg,x EG. 


This shows that g~!o(g) € Cg(a(G)) for all g € G. Finally, consider the map 7 
from G to Cg(a(G)) defined by n(g) = (a(g))~'g. Then 

7(9192) 

= (o(9192)) | 9192 

= (0(92)) (oq) 9192 

= (o(91))'91(0 (92) 'g2 (for (o(g1))~“'g1 € Ce(o(G))) 

= (91) N(92)- 

This shows that 7) is a homomorphism. tt 


Definition 5.2.42 An endomorphism o of G is called a central endomorphism 
if o induces identity automorphism on G/Z(G), or equivalently g~!o(g) € Z(G) 
forall g €G. 


consequently, observe that a central endomorphism maps center to itself. 


Corollary 5.2.43 All central endomorphisms are normal endomorphisms. Also a 
surjective normal endomorphism is central. tt 


Corollary 5.2.44 Let o be an automorphism of G. Thenao € Cauig)(Unn(G)) if 
and only if the map T defined by T(g) = g~'a(g) is a homomorphism from G to 
its center Z(G). Conversely, let tT be a homomorphism from G to its center Z(G). 
Then, the map o defined by o(g) = gt(g) is anormal endomorphism of G. 


Proof To say thata € Caut(g)(Znn(G)) is to say that o is a normal automorphism 
of G. By Proposition 5.2.41, this is equivalent to say that the map 7) defined by 
n(g) = o(g7')g isa homomorphism from G to Cg(a(G)) = Z(G). Since Z(G) 
is abelian, the map 7 defined by r(g) = g~'a(g) = (n(g))~! isa homomorphism 
from G to Z(G). Conversely, let 7 be a homomorphism from G to its center Z(G). 
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Consider the map o from G to G defined by o(g) = gr(g). Then 
o(9192) 

= 9192T(9192) 

= gigoT(gi)T (go) 

= giT(91)92T (92) (for T(g1) € Z(G)) 

= 0(91)0 (92) 

Thus, o is an endomorphism of G. Further, 
o(gxg"') 

= gxg"'r(gxg7') 

= gxg't(g)T(x) (rg)! 

= gxg"'T(x) (for T(x) € Z(G)) 


= gxt(x)g"! 
= go(x)g7!. 
This shows that the map o defined by o(g) = grt(g) is a normal endomorphism 
of G. o 


The following corollary is an immediate consequence of Proposition 5.2.41. 


Corollary 5.2.45 Let G be acenter less group in the sense that Z(G) is trivial. Then, 
the only normal automorphism of G is the identity map. In particular, Z(Aut(G)) 
is also trivial. In turn, for any nonabelian simple group G, Z(Aut(G)) is trivial. { 


Remark 5.2.46 Indeed, for any simple group G, Aut(G) is a complete group in 
the sense that Z(Aut(G)) = {Jc} and every automorphism of Aut(G) is an inner 
automorphism of Aut(G). 


Example 5.2.47 Every endomorphism of an abelian group is normal endomorphism. 
We determine normal endomorphisms of the Quaternion group Qs. The trivial endo- 
morphism and the identity automorphism are obviously normal endomorphisms as 
they commute with all inner automorphisms. Let 7 be another endomorphism of 
Qs. If the image 7(Qg) = {1, —1}, then since Cg, ({1,—1}) = Qs, it follows 
that g-'n(g) € Co,({1, —1}) for all g € Qs. By Proposition 5.2.41, 7 is a normal 
endomorphism. The image of an endomorphism cannot be a subgroup of order 4 
(justify). Suppose that 77 is a nonidentity automorphism. Since Z(Qg) = {1,—l}, 7 
is a normal endomorphism if and only if 7(g) = xg. There are 3 such automor- 
phisms. There are no more automorphisms. This shows that all endomorphisms of 
Qs are normal endomorphisms. 


Example 5.2.48 Trivial subgroup, Qs, and {1, —1} are characteristic subgroups of 
Qs. However, the rest of the subgroups are not characteristic. 


Definition 5.2.49 Let G be a direct product of its subgroups H, Ho,..., H,. Then 
for each k, we have the map p; from G to G defined by px(g) = gx, where g = 
9192 °** Yn is the unique representation g as product of elements of H), Ho,..., An. 
The map p;, thus obtained, is called the ky, Projection. 
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Proposition 5.2.50 Let G be a direct product of its subgroups H\, Hz,..., An. 
Then, we have the following: 

(i) Each px is a normal endomorphism which is idempotent in the sense that 
(px)? = px. The image py(G) = Hk. 

(ii) Each pair pg, pi, k € 1 of distinct projections is summable in the sense that the 
map px + pi defined by (pe + pi)(g) = Pk(9)Pi(g) is again a normal endomor- 
phism (indeed, a projection on H; H)). 

(iii) The composition pop, is the trivial endomorphism for all k # 1. 

(iv) pi + po+---+ Pn = Ig, the identity automorphism on G. 

Conversely, if G is a group which has endomorphisms p, satisfying conditions (1), (ii), 
(iii), and (iv), then G is a direct product of its subgroups p\(G), p2(G),..., Pn(G). 


Proof (i) Let g and h be elements of G with their unique representations g = 
9192°°'gn and h = hyh2---h,, respectively. Then, gh = gihigoh2--- GQnhn. 
Hence, px(gh) = gehe = pr(g)px(h). This shows that p, is an endomorphism 
of G. By the definition px(px(9)) = Pr(Ge) = ge = Px(g). Again, ghg"' = 
gihigy'gohogy' +++ gehkgy' +++ Gnltng,'. Thus, pe(ghg7') = gehege' = ghe 
g°'| = gpxth)g7'. This proves (i). 

(11) Let g and h be elements of G with their unique representations g = 9192--- Gn 


and h = hyh2---h,, respectively. Clearly, then, for k Al, (pe+pi)(gh) = 


Pr(gh)pigh) = gh@eghi = gh = PrQP(QpPrApih) = (pet 
11)(9) (Pe + p1)(h). This proves (ii). 

(iii) An element a € H has unique representation asa = e1@2--- @j-14@j41-+-* en, 
where e; = e the identity of G, and which belongs to H; for all i. Thus, for 


kAl, px(a) = e for all ae H). Again, since p)(G) = A), it follows that 
Px(pi(g)) = e for all g € G. This proves (iii). 

(iv) Let g be an element of G with unique representations g = g192---gn. Then 
by the definition, g, = p;(g) for all i. Thus, g = pi(g)po(g)--- pa(g) = 
(pi + p2 +--+ + pn)(g) for all g € G. This proves (iv). 

Conversely, suppose that G is a group which has endomorphisms p; satisfying 
conditions (i), (ii), (iii), and (iv) of the proposition. Put p,(G) = Hy. Since the 
image of a normal endomorphism is a normal subgroup (verify), H, is a normal 
subgroup of G for each k. This shows that the condition (ii) of Corollary 5.2.25 holds. 
By (iv), every element g € G is expressible as g = (pi +pot+-:-+pn)(g) = 
Pi(g)p2(g)--- Pn(g). This shows thatG = H,H2--- H, and so condition (i) of 
Corollary 5.2.25 also holds. Let a € Hy (\ Hy A2--- Ap: Hei -++ Hy. Then a = 
Ay Ay +++ Ag—1eKAK1 +++ An. Since a € Hy = py(G) and (py)? = px (by (i)), it 
follows thate = eg, = px(a) = a. This proves condition (iii) of Corollary 5.2.25. 
The result follows from Corollary 5.2.25. tt 


Proposition 5.2.51 Let G be a group and 7 an idempotent normal endomorphism 
of G. Then, G is a direct product of n(G) and Ken 7. 


Proof Since 7 is normal endomorphism 7(G) is a normal subgroup. Already, ker 7 
is a normal subgroup. Let g € G. Then, since 7) is idempotent, (7(g))~!g is in the 
ker n. Further, g = 7(g)(n(g))~‘g. This shows thatG = 7(G)ker 17. Suppose that 


170 5 Fundamental Theorems 


g € 9(G) ()ker 7. Then, since 7 is idempotent, g = 7(g) = e. This shows that 
n(G)(\ ker n = {e}. The result follows from Corollary 5.2.25. tt 


Corollary 5.2.52 Let G be an indecomposable group and ) a normal idempotent 
endomorphism. Then, 1 is either trivial or a normal automorphism of G. 


Proof It follows from the above proposition that G is a direct product of 7(G) and 
ker 7. Since G is indecomposable, 7(G) = {e}, orelsen(G) = Gandkern = fe}. 
This shows that 77 is trivial or else an automorphism. tt 


Proposition 5.2.53 Every finite group is a direct product of indecomposable groups. 


Proof The proof is by induction on | G |. If| G | = 1, then there is nothing to prove. 
Assume that every group whose order is less than that of G is a direct product of 
indecomposable groups. We show that G is also a direct product of indecomposable 
groups. If G itself is indecomposable, there is nothing to do. If not, then G is a 
direct product of two nontrivial subgroups H and K of G. Clearly, then| H | <| G | 
and | K |< | G |. By the induction hypothesis, H and K are direct products of 
indecomposable subgroups. Hence, G is also a direct product of indecomposable 
subgroups. tt 


Definition 5.2.54 A subgroup H ofa group G is called a direct factor of G if there 
is a subgroup K of G such that G is a direct product of H and K. 


Thus, all direct factors are normal subgroups. Normal subgroups need not be 
direct factors. Indeed, all subgroups of the additive group Z of integers are normal 
but no nontrivial proper subgroups are direct factors (verify). 

The following theorem known as Krull-Remak—Schmidt Theorem will be proved 
in Chap. 10. 


Theorem 5.2.55 (Krull-—Remak—Schmidt) [fa group G is a direct product of inde- 
composable subgroups H,, Hz, ..., H, and also a direct product of indecomposable 
subgroups K,, K2,..., Ks, then 

(i)r = s, and 

(ii) there is a bijective correspondence between the sets {H\, Hz,..., H-} and 
{K,, Ko,..., K,} such that the corresponding subgroups are isomorphic. tt 


Remark 5.2.56 The above results reduce the problem of classification of finite groups 
to the problem of classification of indecomposable groups. The solution to this prob- 
lem is beyond a dream to mathematicians. 


Exercises 


5.2.1 Show that the direct product of abelian groups is abelian. 


5.2.2 Let G be the internal direct product of groups G;, Go, ..., Gn. Show that 
each G; is normal in G. 
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5.2.3 Let H <1 G and K a subgroup of G containing H. Show that H <i K. 
Remark 5.2.57 We shall see that H < K and K <G need not imply that H IG. 


5.2.4 Let H <1G and K a subgroup of G. Show that HK is a subgroup of G. We 
shall see that this subgroup need not be normal in G. 


5.2.5 Show that the Klein’s four group Vs = {e, a, b, c} is the internal direct product 
of its subgroups H = {e,a} and K = {e,b}. Deduce that it is isomorphic to 
Z2 x 22. 


5.2.6 (i) Show that the group P of roots of unity is indecomposable. 
(ii) Show that Qg is indecomposable. 
(iii) Show that Direct factor of a direct factor is itself a direct factor. 
(iv) Characterize groups all of whose subgroups are direct factors. 


5.2.7 Is the additive group R of reals indecomposable? 


5.2.8 Show that the multiplicative group of positive rational numbers is indecom- 
posable. 


5.2.9 Express V4 as a direct product of indecomposable groups. 


5.2.10 Characterize subgroups H of a group G for which the relation R’ is anormal 
subgroup of G x G. 


5.2.11 Show that all subgroups of Qg x Zp are normal. Find all of its direct factors. 
5.2.12 Show that all subgroups of Qg x Z3 are normal. Find all of its direct factors. 


5.2.13 Show that every abelian group of order pq, where p and q are distinct primes, 
is cyclic. 


5.2.14 Is it true that all subgroups of Qg x Z4 are normal? Support your claim. 


5.2.15 Show that the direct image of a normal subgroup under a homomorphism 
need not be a normal subgroup. 


5.2.16 Show that the set /(G) of inner automorphisms of G is a normal subgroup 
of Aut(G). 


5.2.17 Let H be a subgroup of G such that x? € H Vx € G. Show that H 1G. 


5.2.18 Let G bea finite simple group and f a nontrivial endomorphism of G. Show 
that f is an automorphism of G. 


5.2.19 Let G be a group which has a unique subgroup H of order m > 1. Show 
that HG. 
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5.2.20 Show that the intersection of a family of normal subgroups of a group G is 
normal in G. 


5.2.21 Characterize the elements of the normal subgroup generated by a set S. Show 
that, in general, it is bigger than the subgroup generated by S. 


5.2.22 Let S be a normal set in the sense that ghg™! € S for allg € Gandhe S. 
Show that < S$ > is normal in G. 


5.2.23 Write explicitly the elements of the group Qs/{1, —1}. Show that it is iso- 
morphic to V4. 


5.2.24 Call a normal subgroup H of G to be a maximal normal subgroup, if it 
is a proper normal subgroup which is not properly contained in any proper normal 
subgroup. Show that a normal subgroup H is maximal normal if and only if G/H is 
simple. 


5.2.25 Show that the additive group Q of rationals, the additive group R of reals 
and the circle group do not contain any maximal normal subgroups. 


5.2.26 Determine all maximal normal subgroups of the additive group Z of integers. 


5.2.27 Let G be a finite abelian group and m divides | G |. Show that G has a 
subgroup of order m. 
Hint. Prove it by induction on | G |, using Cauchy theorem. 


5.2.28 Let G be a finite abelian group all of whose elements are of orders a power 
of p, where p is a prime. Show that | G | = p” for some n. 


5.2.29 Let H bea subgroup of G. Let Ng(H) = {g € G| gH = Hg}. Show that 
Ng (#1) is a subgroup of G containing H as a normal subgroup. Show further that 
Ng (11) is the largest subgroup of G in which H is normal. This subgroup is called 
the normalizer of H in G. 


5.2.30 Let H be a subgroup of G. Show that Coreg(H) = Nec gHg"' is the 
largest normal subgroup of G contained in H. This subgroup is called the core of H 
inG. 


5.2.31 Let G be a group and H = <a > acyclic subgroup which is normal in G. 
Show that every subgroup of H is normal in G. 


5.2.32 Let H, K, L be subgroups of G, where L is normal in G. Suppose that 
HL = KLandH()L = K()\L.ShowthatH = K. 


5.2.33 Let H be a subgroup of G. Show that Cg(H) < Ng(H), where Cg(H) is 
the centralizer of H in G. 
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5.2.34 Find the commutator subgroup of the Quaternion group Qs. 
5.2.35 Find the commutator subgroups of D4. 


5.2.36 Find all characteristic subgroups and also all fully invariant subgroups of a 
Dihedral group. 


5.2.37 Let o and 7 be a pair of normal summable idempotent endomorphisms of 
a group G such that o + 7 is an automorphism. Show that one of o and 7 is an 
automorphism. 


5.2.38 Let 7 be a normal endomorphism of G. Show that all normal subgroups of 
7™(G) are also normal subgroups of G. 


5.2.39 Let G be a group and H a subgroup of G. Let S be right transversal to H 
in G (S is obtained by selecting one and only one member from each right coset of 
H in G with the choice e for the coset H). Define a binary operation o on S by 


{xoy} = Hxy a S 


Show that (S, 0) is a right quasigroup in the sense that equations Xoa = b have 
unique solutions for all a, b € S (Conversely, it is proved in [Ramji Lal, “Transver- 
sals in Groups,’ Journal of Algebra 1996] that every right quasigroup with identity 
turns out to be a right transversal to a subgroup H in group G with G, H univer- 
sal in certain sense). Show that if H <G, then right quasigroups determined by 
all right transversals are isomorphic to G/H (Conversely, it is shown (Ramji Lal— 
R.P. Shukla, ‘Perfectly Stable Subgroups of a Finite Group,’ Communications in 
Algebra 1996), using classification of finite simple groups, that if all right transver- 
sals to a subgroup H of a finite group G determine isomorphic right quasigroups, 
then H IG). 


5.3. Fundamental Theorem of Homomorphism 


In the last section, we noticed that a quotient group G/H is homomorphic image of 
G. In this section, we show that every homomorphic image of G is isomorphic to a 
quotient group of G. 


Theorem 5.3.1 (Fundamental Theorem of Homomorphism) Let f be a homomor- 
phism from a group G to a group G'. Let K be the kernel of f. Let H <1 G. Then, 
there exists a homomorphism f from G/H to G' making the diagram 
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GH 


commutative (fov = f) if and only if H © K. Further, then, 
(i) sucha f is unique. 
(ii) f is injective if and only ifH = K. 
(iii) f is surjective if and only if f is so. 


Proof Suppose that such a homomorphism f exists. Let h € H. Since H is the 
identity of G/H and f is ahomomorphism, 


e' = f(H) = f(hH) = fv) = f@) 


Hence, h € K.Thus, H C K. Conversely, suppose that H C K. Suppose further that 
aH = bH.Then,a~'b € H.Since H C K, a"'b€ K.Intum,e’ = f(a~'b) = 
f(a)' f(b), and so f(a) = f(b). This ensures that we have a map f from G/H 
to G’ defined by f(aH) = f(a). Now, 


F(aH *bH) = f(abH) = flab) = f(a) f(b) = Fla) f(bH). 


Thus, f is a homomorphism. By the definition of f, fov = f. 

Next, suppose that such a homomorphism f exists. If g is also a homomorphism 
such that fov = f = gov, then since v is surjective, f = g. This proves (i). 
We know that f is injective if and only if kernel of f is the trivial subgroup {H} of 
G/H. Now, 


ker f = {aH €G/H|e' = f(aH) = f@} = {aH | ae€ K}= K/H 


Clearly, K/H = {H}ifand only if K = H. This proves (ii). _ 
Finally, since v is surjective, fov = f is surjective if and only if f is 
surjective. tt 


Corollary 5.3.2 Let f be a surjective homomorphism from G to G'. Then, G/ker f 
is isomorphic to G'. tt 
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Remark 5.3.3 The last corollary is used very frequently and is also termed as the 
fundamental theorem of homomorphism. 


Applications of the fundamental theorem of homomorphism 
3.1. R/Z is isomorphic to the circle group S!. 


Proof Consider the map f from the additive group R of real numbers to the circle 
group S! defined by f(a) = e?*'*. By the law of exponents, it follows that f is a 


homomorphism which is surjective. Further, e?"“ = 1 if and only if a € Z. Thus, 
the kernel of f is Z. From the fundamental theorem of homomorphism, the result 
follows. tt 


3.2. Q/Z is isomorphic to the group P of roots of unity. 


Proof e?"'“ € P if and only if there exists n € Z such that (e?"*)" = e?7ma — J, 
This is equivalent to say that a € Q. Thus, we have a surjective homomorphism f 
from Q to P given by f(a) = e?* whose kernel is Z. The result follows from the 
fundamental theorem of homomorphism. tt 


3.3. Suppose that n divides m. Then mZ C nZ and nZ/mZ is isomorphic to Z™. 


Proof Clearly,mZ C nZ if and only if every integral multiple of m is also an integral 
multiple of n. This means n divides m. Define a map ¢ from Z to nZ/mZ by 
o(r) = nr +mZ. dis clearly a surjective homomorphism. Now nr +mZ = mZif 
and only ifnr € mZ. This is equivalent to say that r is a multiple of “. Thus, the kernel 
of ¢ is “Z. The result follows from the fundamental theorem of homomorphism. 


3.4. Let G be a group. Then, G/Z(G) is isomorphic to the group 7(G) of inner 
automorphisms of G. 


Proof The map f from G to 1(G) defined by f(g) = fj, where f, is the inner 
automorphism determined by g (f(x) = gxg7') is a surjective map which is easily 
seen to be a homomorphism. Now, 


ker f = {9g €G| fy = Ic} = {9 € Gl gxg!' = xVx EG} = {geEG| 
gx = xgVx € G} = Z(G). By the fundamental theorem of homomorphism the 
result follows. tt 


Theorem 5.3.4 (1st Isomorphism Theorem) Let f be a surjective homomorphism 
from G to G'and H 1G such that ker f C H. Let H' = f(A). Then H’ 1G’, 
and there exists a unique isomorphism f from G/H to G'/H' such that the following 
diagram is commutative (fovg = Vqof ), where vg and vq are the corresponding 
quotient maps). 
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VG Vq 


a! gum 


Proof Consider the map ¢ = vc of from G to G’/H’. Since vg and f are surjective 
homomorphisms, ¢ is a surjective homomorphism. Now, 

kerb = {a€eG|d(a) = H’}. 

= {a€Glvg(f@) = H’}. 
{ae G|f@H' = H’}. 
= {aeG|f@eH'. 

= fH’). 

= f'(f(H)). 

Since H contains the kernel of f, by Proposition4.4.28, f-'(f(H)) = H. 
Hence, the kernel of ¢ is H. The result follows from the fundamental theorem of 
homomorphism. tt 


Corollary 5.3.5 Let H and K be normal subgroups of G such that H © K. Then, 
K/H <G/H and (G/H)/(K/H) is isomorphic to G/K. 


Proof Consider the quotient map v from G to G/H. Then, v is surjective homo- 
morphism, andkerv = H.Alsov(K) = K/H. The result follows from the first 
isomorphism theorem. tt 


Theorem 5.3.6 (Noether 2nd Isomorphism Theorem) Let H and K be subgroups 
of G and K <1G. Then, H (\ K < H and H/H () K is isomorphic to HK/K. 


Proof Since K 1G, HK = KH, and so HK is a subgroup of G. Since K < 
G, K <1 HK. Thus, we can consider the quotient group H K/K.The map ¢ from H 
to HK /K defined by ¢(h) = hK isclearly ahomomorphism. Further, any element 
of HK /K isof the formhkK for someh € H andk € K.Sinceh~'hk = k belongs 
toK,hkK = hK = ¢(h). This shows that ¢ is surjective homomorphism. Now, 


ker 6={heH|¢h)=Ks={he H|hK=K)={he A |he K}. 
Thus, ker 6 = H()K. Since kernel of a homomorphism is a normal subgroup 


H ()K <H. Also by the fundamental theorem of homomorphism, H/H () K is 
isomorphic to HK/K. tt 
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Remark 5.3.7 The assumption that K <I G is essential, for otherwise H K /K or even 
HK need not be groups. 


Exercises 
5.3.1 Show that Qg/{1, —1} is isomorphic to V4. Deduce that /(Qg) is isomorphic 
to V4. 


5.3.2 Let G be a group such that Z(G) = {e}. Show that G is isomorphic to 
I(G). Deduce that any nonabelian simple group is isomorphic to its group of inner 
automorphisms. 


5.3.3 Show that G/Z(G) and /(G) cannot be a nontrivial cyclic group. 
5.3.4 Can | /(G) | = 29? Support. 


5.3.5 Let G =< a > beacyclic group of order m. Define a map f from Z to G 
by f(x) = a”. Show that f is surjective homomorphism with kernel mZ. Deduce 
that G is isomorphic to Z. 


5.3.6 Show that Z,, is isomorphic to (Z/mZ)/(nZ/mZ), where n divides m. 


5.3.7 Show that R/Q is isomorphic to $'/P. 
Hint. Consider the map x ~» e?” ‘*, and use the first isomorphism theorem. 


5.3.8 Show that mZ/mZ ()\nZ is isomorphic to mZ + nZ/nZ. Deduce that Zim. 
is isomorphic to Za, Deduce further that [m,n](m,n) = mn. 


5.3.9 Show that Q + 27Z/27rZ is isomorphic to Q. Deduce that S' contains an 
isomorphic copy of Q. 
Hint. Observe that Q(]27Z = {0}. Use the Noether isomorphism theorem. 


5.3.10 Show that Q + eZ/eZ is isomorphic to Q. 
5.3.11 Show that 27Z + Q/Q is isomorphic to Z. 


5.3.12 Let H, K be subgroups of G and L < GsuchthatHL = KL, H(\L = 
{e} = K ()L. Show that H is isomorphic to K. 


5.3.13 Show by means of an example that H; ~ H> and G,/H, ~ G2/ Hp need not 
imply that G; ~ Go. 


5.3.14 Show that Ng(H)/C¢(A) is isomorphic to a subgroup of Aut (H). 
Hint. Consider the map g ~+ f,, where f,(h) = ghg™!. 


5.3.15 Use the fundamental theorem of maps in set theory to prove the fundamental 
theorem of homomorphism and Isomorphism theorems. 


Chapter 6 
Permutation Groups and Classical Groups 


The two main sources of groups are the permutation groups and the matrix groups. 
This chapter is devoted to introduce these groups, and to study some of their funda- 
mental and elementary properties. In Algebra 2 and in Algebra 3, we shall study the 
representations of abstract groups as permutation groups, and also the representations 
of abstract groups as matrix groups. 


6.1 Permutation Groups 


In this section, we shall introduce the permutation groups (also called the symmetric 
groups) and representations of abstract groups in these groups. 

Let X bea set. Let Sym(X) denote the set of all bijective maps from X to X. Then, 
Sym(X) is a group with respect to the composition of maps. This group is called 
the Symmetric group or the Transformation group or the Permutation group 
on X. The proof of the following proposition is an easy verification. 


Proposition 6.1.1 If f is abijective map from X to Y, then it induces an isomorphism 
Sym(f) from Sym(X) to Sym(Y) defined by 


Sym(f)(g) = fogof'. tt 


Thus, the group Sym(X) depends only on the cardinality of X. In particular if X 
is finite, then Sym(X) depends (up to isomorphism) on the number of elements in 
X. In this section, we shall be interested in symmetric groups on finite sets only. If 
X contains n elements, then without any loss we can take X = {1,2,...,n}. 

The symmetric group Sym(X), where X = {1,2,...,m} will be denoted by S, 
and will be called the symmetric group or the permutation group of degree n. 
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Thus, S,, is the group of all bijective maps from {1, 2, ... , n} to itself, the operation 
being the composition of maps. We may represent an element f € S, (without any 
ambiguity) by 

( 1 2. cx Th ) 
fA) FQ)... fn) 


Since f is a bijective map, the second row is just the rearrangement (permuta- 
tion) of 1, 2,...,. Thus, any f € S, gives a unique permutation described above. 
Conversely, if we have a rearrangement of 1, 2,..., , then it gives rise to a unique 
bijective map from {1, 2, ..., m} to itself by putting the rearrangement below 12... 
as above. For example, ifn = 4, the rearrangement 2314 of 1234 gives rise to a 
bijective map a from {1, 2, 3, 4} to itself given by a(1) = 2, a(2) = 3, a(3) = 1 
and a(4) = 4. In the above notation 


Thus, the members of S,, can be viewed as permutations. The product gf of 


permutations 
¥e ( 1 2 soe i ) 
PU) Fee TO) 
and 
_ 1 2 san HK 
#= (gin am 20) 
is given by 


f= ( 1 2 sates n ) 
8! = \ e(F(1)) g(f(2)) -.. g(f(n)) 


Example 6.1.2 If 
and 
then 


( 
: ) 
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Thus, a8 4 Ba 
Example 6.1.3 Taken = 3, 


and ‘s 
Pi) 
Then are 
ab = (331) 
and 


Again a3 4 Ga. This shows that S3 is nonabelian. 


Let n < m. Every permutation in S, can be viewed as a permutation in S,, by 
fixing alli > n. This defines an embedding of S,, in to S,,. In other words S, can 
be thought of as a subgroup of S,,. It follows that S,, is abelian if and only ifn < 2. 

Since the number of permutations on n symbols isnp, = n!, we have 


Theorem 6.1.4 | S, |= n!. tt 
Corollary 6.1.5 p” = I forall p € S,, where I is the identity permutation. 
Proof Follows from the Corollary 5.1.17. tt 


Cycles and Transpositions 


Now, we consider special types of permutations, for example, the permutation 
_ (123456 
OS 12539164 
a takes | to 2,2 to 5, 5 to 6, 6 to 4 and 4 to 1. The remaining symbol 3 is fixed. We can 
faithfully represent the permutation a by the row (1 2 5 6 4) with the understanding 


that each symbol goes to the following symbol, the last symbol is mapped to the first 
symbol, and the symbol not appearing in the row is kept fixed. Thus, the permutation 


1234567 
1524763 


can be represented by (2 5 7 3) whereas 
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1234 
2143 


Definition 6.1.6 A permutation a € S,, is called a cycle of length r > 1 if there 
exists a subset {i;,i2,...,i-} of {1,2,...,m} containing r distinct elements such 
that a(i,) = i2, a(i2) = 13,...,a(i,-1) = i, a(i,) = i; anda(j) = j forall 
J € {i, i2,...,i-}. The cycle ais denoted by (iji2 ...i,). A cycle of length 2 is called 
a transposition. Thus, a transposition is represented by (i j) which interchanges i 
and j and keeps the rest of the symbols fixed. 


cannot be represented in this form. 


Thus, the identity permutation is not treated as a cycle. 
Cycles (iji2...i,) and (jj jo... j;) are said to be disjoint if 


fi, in... i (Vis dod} = B 
Proposition 6.1.7 Any two disjoint cycles commute. 


Proof Let a = (iiz...i-) and @ = (ji jo... Js) be disjoint cycles. If 1 ¢ 
{ij,12,---, lr, Ji, Ja,---, Js}, thena(@) = Bl) = landsoa(G(l)) = B(a()). If 
l € {ij, i2,...,i,}, then ( fixes / and also a(/). This means that a(G(/)) = a(l) = 
B(a()). Similarly, if 7 € (71, jo,..-, js}, then also (aZ)\7) = BO) = (Ba)(). 
This shows that Ga = af. ft 


Proposition 6.1.8 [fa and @ are disjoint cycles, then < a>{)<B>= {I}. 


Proof If a and are disjoint cycles, then the symbols changed by any power of a is 
fixed by any power of 3 and the symbols changed by any power of (3 is fixed by any 
power of a. Thus, a! = 8” ifand only ifa' = I = B". tt 


Remark 6.1.9 Tf p and q are permutations such that the symbols changed by p 
are fixed by q and the symbols changed by g are fixed by p, then pg = gp and 
<p> {)<4q>= {IJ}, and so in this case o(pq) = [o(p), 0(q)]. 


Proposition 6.1.10 Leta € S, be acycle of length r. Then, o(a) = r. 


Proof Leta = (iji2...i,). One observes inductively that a'(i;) = i,4, fort < 
r — 1.Hence,a! ¢ I fort < r — 1.Next,a’(i,) = a(a”'(i,) = ali,) = i). Also 
al (i,) = aX(al"()) = ala’ (al!) = al (a!) = a l(aGi,)) = 


al-!(i,) = i,. Hence, a” = I. This shows that o(a) = r. t 
Proposition 6.1.11 Let {a,,a2,...,a,} be a set of pairwise disjoint cycles of 
lengths m,,m2,...m, respectively. Leta = a,Q2...a;. Then, o(a) = [m,,m2, 


...,M,] the least common multiple of m,, m2, ...m,y. 
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Proof Letm = [m,,mz2,...m,]. By the above proposition, o(a;) = m; foralli € 
{1,2,...,r}.Since a), a2,..., a, are pairwise disjoint, they also commute pairwise 
andsoa” = al'as'...aj” = I.Suppose further thata’ = ajas...a), = I.Since 
the symbols changed by ai is kept fixed by a' j # i, it follows thata; = J foralli. 
Since o(a;) = mj, it follows that each m; divides t. Hence, m divides t. This shows 


that o(a) = m. t 
Example 6.1.12 The order of the permutation (1 3 4)(2 5) is [3, 2] = 6. 


Theorem 6.1.13 Every nonidentity permutation can be written as a product of dis- 
joint cycles. Further, any two representations of a nonidentity permutation as product 
of disjoint cycles is same up to rearrangement of cycles. 


Proof Let a be a nonidentity permutation in S,. Then, there exists i, such that 
a(i;) #i,. Since o(S,) = n}, a” = TJ. Hence, a (i;) = i. Let 1; be the 
least positive integer such that a!'(i;) = i;. Given m € Z, by the division algo- 
rithm, there exist q,r such thatm = l,q + r, where O <r </, — 1. But, then 
a" (i;) = a’ (i;). Itis clear from the above observation that the effect of the permu- 
tation a on symbols in {i,, a(z,), a?(i;),..., a'~!(i,)} is same as that of the cycle 
Cy = (i; a(i,) a7 (in)... a“! i). Ifa = Cy, there is nothing to do. If not, there 
exists in ¢ {i,, a(i1), a?(i,),...a'—!(i,)} such that a(ir) # iz. As before consider 
the cycle C2 = (i2 a(i2) a? (iz)... a/2~! (iz), where Jy is the smallest positive inte- 
ger such that a(i2) = in. Clearly, C; and C3 are disjoint cycles. If a = C,Co, 
there is nothing to do. If not proceed. This process stops after finitely many steps 
giving a as product of disjoint cycles, because the symbols are finitely many. 
Finally, we prove the uniqueness. Suppose that a # I and 


63 CCC) = OCC, 


where C; and C; are disjoint for i A j, and also C; and C; are disjoint for k # 
1. Suppose that a(p) # p. Then, there exist i,k such that C;(p) 4 p and also 
C,(p) # p. We may assume that C\(p) € p and C\(p) # p. But, then, using the 
arguments of the previous paragraph, we find that C; = Cj}. Canceling C; and C}, 
using induction and the fact that products of nonidentity disjoint cycles can never be 
identity, we find thatr = s and C; = C’ for alli. tt 


Remark 6.1.14 The proof of the above theorem is algorithmic and gives an algo- 
rithm to express a permutation as product of disjoint cycles. Thus, using the above 
propositions we can find the order of any permutation. 


Example 6.1.15 Consider the permutation a given by 
1234567 
4527361 


a(l) = 4,02(1) = a(4) = 7,03(1) = a(7) = 1. Thus, C; = (147). Now 
2 ¢ {1,4, 7} and a(2) = 542072) = 3,032) = 2. Thus, = (25 3). 
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Further, 6 is the only symbol left which is fixed by a. Hence, a = Cj - Cp. Further, 
by the Proposition 5.4.11, o(a@) = [3,3] = 3. 


Example 6.1.16 The number of elements of order 6 in Ss is 20. For, an element in 
S5 is of order 6 if and only if it is product of two disjoint cycles one of length 3 and 
the other of length 2. Once a cycle of length 3 is picked up there is a unique cycle of 
length 2 which is disjoint from the selected cycle of length 3. Thus, there are as many 
elements of order 6 in S5 as many distinct cycles of length 3. Further, corresponding 
to a choice {a1, @2, a3} C {1, 2,3, 4,5} of 3 elements, there are exactly 2 distinct 
cycles of length 3, namely (a;a2a03) and (a1a3a2). Thus, there are 2- 5C; = 20 
distinct cycles of length 3, and as such there are 20 elements of order 6. 


Example 6.1.17 S7 contains no elements of order 8. For, a permutation is of order 8 
if and only if it can be written as product of disjoint cycles such that the least common 
multiple of lengths of these cycles equals 8. We also observe that l.c.m of certain 
numbers equals 8 only if at least one of them is 8. Thus, we cannot have any such 
permutation in $7. Similarly, S7 contains no element of order 15, for a permutation 
of order 15 contains at least one cycle of length 5 and at least one cycle of length 3, 
or it should contain a cycle of length 15 in its decomposition as product of disjoint 
cycles. 


Definition 6.1.18 Let a and ( be two nontrivial permutations in S,,. We say that a 
and ( are of same form if 


(i) the number of cycles in the representation of a as product of disjoint cycles is 
same as that in the representation of 3 as product of disjoint cycles, and 

(ii) there is a bijective correspondence between the set of cycles used in the rep- 
resentation of a and that in the representation of (3 so that the lengths of the 
corresponding cycles are same. 


Thus, (1 2 3)(4 7)(5 6 9 8) and (4 5 2 3)(8 9)(1 6 7) are permutations of same 
form where as (1 2 3)(4 7) and (4 5 2 3)(8 9) are not of the same form. 


Theorem 6.1.19 Let a, 3 € S,. Then, a and (3 are conjugate in S, if and only if 
they have same form. 


Proof Leta = aja2...a,, where {a}, Q2,...,@,} is a set of pairwise disjoint 
cycles. Let p be a permutation. Then, 


pap = pap '- pagp™'... parp'. 
eta, (G8) ncke at =: 1,2, 29057 When, 
paip | = (p(éi)p(&)... pE,)) 


Thus, pap7! = (1/)...(,, where Bi = (p(€,)p(é)... p(Ei,)) is a cycle of 
length n;. Note that {3,, G2, ...,} is a set of pairwise disjoint cycles. This shows 
that a and pap™! are of the same form. 
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Conversely, suppose thata = a,a2...a, and = (3)... are of the same 
form, where {a), Q2,..., a} and {(), 2, ..., 3} are sets of pairwise disjoint cycles 
with /(a@;) = 1(G;) for all i. Suppose that 


a; = (€)---§,) 


and _ 
Bi = (MN ---M,) 


Take a permutation p which takes €/ to 7, and maps the set {1,2,...n} — {€ | 1 < 
J <n;,1<i <r} biyectively to the set {1,2,...,n} — {nj |l<j<nj,1l<i< 
r}. Then, it is evident that pap~'! = 8. tt 


Following corollary is immediate from the definition of a normal subgroup. 


Corollary 6.1.20 A subgroup H of S, is normal if and only if a € H implies that 
all permutations which are of the same form as of a belongs to H. More precisely, 
normal subgroups of S, are subgroups which are unions of classes of permutations 
of same forms. tt 


Example 6.1.21 To find all normal subgroups of 53, we divide S3 into the classes of 
permutations of same forms: 

Cc; = {J}. 

C2 {(123), (132)}. 

C3 = {(12), (23), (13)}. 

Apart from $3 and Cy; = {J}, C; UC2 = {I, (123), (132)} is a subgroup of 
S3 (verify) which is therefore normal. Observe that C; (J C3 cannot be a subgroup 
because it contains 4 elements and 4 does not divide 6. Thus, we have only one 
nontrivial proper normal subgroup, viz {7, (123), (132)}. 


Example 6.1.22 To find normal subgroups of S4, we divide it into the classes of 
permutations of same forms: 


Cc; = {J}. 

Cy = {(1234), (1324), (1423), (1243), (1342), (1432)} the sets of cycles of 
length 4. 

C3 = {(123), (132), (234), (243), (134), 143), (124), (142)} the set of cycles of 
length 3. 


Cy = {(12)G4), 14)(23), (13) (24)} the set of products of two disjoint transpo- 
sition. 

C5 = {(12), 13), 14), (23), (24), (34)} the set of transpositions. 

Let H be a nontrivial proper normal subgroup of S4. Then, clearly C, C H. 
Further, H must be complete union of some of the classes C;. By the Lagrange 
theorem, | H | divides 24. The only possibilities, therefore are H = C; J)C3 Ca 
and H = C; \J Cs (itcan be checked easily that these are, indeed, subgroups). Thus, 
nontrivial proper normal subgroups of S4 are 
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{Z, (123), (132), (234), (243), (134), (143), (124), (142), (12) (34), (13) (24), (14) (23)}, 


and 
{7, (12) (34), (13)(24), (14) (23)}. 


Remark 6.1.23 The subgroup H = {1/, (12)(34), (13)(24), (14)(23)} of $4 which 
is normal in S4 is isomorphic to the Klein’s four group V4 (the map e ~ J,a ~ 
(12) (34), b ~ (13)(24), c ~ (14)(23) is an isomorphism). This subgroup is also 
denoted by V4 and called the Klein’s four group. Thus, V4 <J S4. 


Definition 6.1.24 Letn € N. Anordered set {i; < iz < --- < i,} of natural numbers 
is called a partition of n ifi; + i2 +---+i, = n. The partition of n in which each 
i; = 1 is called the trivial partition of n. Let p(1) denote the number of partitions 
of n. The function p thus obtained is called the partition function. 


Thus, p(1) = 1,forl = 1, p22) = 2,for2 =2,2=1+1, p(3) = 3,for3 = 
3,3=24+1,3=1+4+1+41,and p(4 = 5, for4=4,4=34+1,4=242,4= 
2+1+4+1and4=14+1+4+1-+41. 


Proposition 6.1.25 The number of conjugacy classes of S, is p(n). 


Proof The trivial partition of n corresponds to the trivial conjugacy class {J} consist- 
ing of the identity permutation only. Given a nontrivial partition {7; < iz <--- <i,} 
of n, there is a unique s <r such that i; = 1 for all j <s and i; > 2 for all 
j =s-+1. This partition determines uniquely a class of permutations which are 
products of r — s disjoint cycles of lengths i;+1, is;2,...i,, and conversely, every 
class of permutation determines uniquely a nontrivial partition. ft 


Example 6.1.26 V4 <1 Sq and {J, (12)(34)} < V4, but {7, (12)(34)} is not normal in 
S4. Thus, H <| K and K <j G need not imply that H 4G. 


Proposition 6.1.27 S4/V4 ~ $3. 


Proof V4 = {I, (12)(34), (13)(24), (14)(23)} is anormal subgroup of S4 and $3 = 
{7, (123), (132), (12), (13), (23)} is a subgroup of Sy such that V4) S3 = {7}. By 
the Noether isomorphism theorem, 


S3V4/ V4 © S3/(V4 (| Ss) © S3/(1} © Ss 


Also since | V4 () S3 | = 1, it follows that 


| S3Val= ere = 24. 
Thus, $3V4 = Sy. Hence, $4/V4 ~ $3. t 


Example 6.1.28 The number of distinct cycles of length r in S,, is i . a For, 
every arrangement of r elements from {1,2,...,} determines a cycle of length 
r and two cycles (i,i2...i,) and (ji j2...Jj,-) of length r are same if the arrange- 
ment ji j2...j, can be obtained by cyclically permuting i,i2...i,. Thus, 7 distinct 
arrangements define same cycles. Since there are ““ arrangements of r elements 


(n—r)! 
the result follows. 
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Proposition 6.1.29 Every cycle is product of transpositions. 

Proof (iji2...i-) = (ir) (iip-1) ... Griz). tt 
Since every permutation is product of disjoint cycles, we have 

Corollary 6.1.30 Every permutation is a product of transpositions. tt 


Remark 6.1.31 Representation of a permutation as product of transpositions is not 
unique. For example, 


(1234) = (14)(13)(12) = (14)(13)(12)(24)(24) = (14)(23)(13). 


6.2 Alternating Maps and Alternating Groups 


Let a € S,. Consider the following rational number: 


a(1) — a(2) a(1) — a(3) a(1) — a(n) ; a(2) — a(3) a(2) — a(n) a(n — 1) — a(n) 
1-2 1-3 -* l-n 2-3 - 2—n “"(—1l)-n 


The above expression in short is denoted by 


ll a(i) — a(j) 
1<i<j<n i-J ' 


Proposition 6.2.1 [],<;—j<n a = +1 forall a € Sy. 


Proof Since ais a permutation, for all pair (k, /) there is a unique pair (p,q), a(p) = 
kanda(q) = I.If p < q,thenk —/ appears once and only once in the numerator of 
the expression, and if g < p, then/ — k appears once and only once in the numerator 
of the expression. Also k — / or! — k appears once and only once in the denominator 
according as k < / or! < k. This proves the result. tt 


Definition 6.2.2. The map x from S,, to {1, —1} defined by 
= a(i) — a(j) 
x(a) a Whee, 1 | : 


is called the alternating map of degree n. 


Theorem 6.2.3. The alternating map x is a surjective homomorphism from S,, to 
{1, —1} which takes any transposition to —1. 


Proof We first show that x is a homomorphism. 
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Bi) — , 
@) = Thee oO ae 2 

a(B(i)) — a(B(j)) BW — BY) a(G(i)) — a(B(j)) 
ieee ieee = 1 mee 


: 3). 
B@ — BU) i—j apy, 


Since @ is a permutation of 1, 2, ...n, it follows that 


a(Bi)) — a(BGj)) __ 
| Pree Bw — BU) = x(a). 


This shows that y is a homomorphism. Since {1, —1} is abelian, it follows that y is 
constant on each conjugacy class. Since any two transpositions (being permutations 
of same form) are conjugate to each other, it suffices now to show that y(7) = —1, 
where T = (12). Since T(i) = i forall i => 3, we have 


2-1 2-3 2-n 1-3 1-4 l-n 
x(7T) = : sca : : he, = -l 
1-2 1-3 l-n 2-3 2-4 2—n 


Thus, x takes any transposition to —1. Clearly, it takes J to 1. tt 


Corollary 6.2.4 Let a € S,. Suppose that 
Q = 0102...0, = T1T2.--Ts, 


where o; and T; are transpositions. Then, r = s(mod2), i.e., 2 divides r — s (equiv- 
alently r and s both are simultaneously even or simultaneously odd). 


Proof From the above theorem, it follows that 


x(@) = X(71)X(02)..-X(Or) = X(T) X(T2) .-- X(T) 
Since x takes a transposition to —1, (—1)" = (—1)*. Hence, r — s is even. t 


Remark 6.2.5. From the above corollary, it follows that if we can write a permutation 
as a product of even number of transpositions, then we cannot write it as a product 
of odd number of transpositions, and if we can write it as a product of odd number of 
transpositions, then we cannot write it as a product of even number of transpositions. 


Definition 6.2.6 A permutation a is called an even permutation if it can be 
expressed as a product of even number of permutations, or equivalently x(a) = L. 
It is said to be an odd permutation if it can be expressed as a product of odd number 
of transpositions. 
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Thus, the set of all even permutations is the kernel of and we have the following: 


Corollary 6.2.7 The set of all even permutations of degree n is a normal subgroup 


Of Sp. t 


Definition 6.2.8 The group of all even permutations of degree n is denoted by Any, 
and it is called the alternating group of degree n. Thus, A, <I S;. 


By the fundamental theorem of homomorphism, we have 


Corollary 6.2.9 §,/A, ~ {1, -l}. tt 
By the Lagrange theorem, | S, | =| An |- | Sn/An |= 2 An |. Thus, we have 

the following. 

Corollary 6.2.10 | A, |= 4%. t 


Remark 6.2.11 There are u even permutations and the rest a permutations are odd. 
Product of any two even or product of any two odd permutations are even and product 
of an even and an odd permutation is odd. 


Proposition 6.2.12 A cycle of length r is even if and only if r is odd. 


Proof Leta = (ijiz...i-) be cycle of length r. Then, 
a = (ii,-)(ii--1)... Giri). 


Hence, a is product of r — | transpositions. The result now follows from the defini- 
tion. tt 


In general, we have the following: 


Proposition 6.2.13 [fa is product of cycles of lengths nj, no, ...,n,, then it is even 
if and only ifn, +nz+---+n, — 1 is even. tt 


Proposition 6.2.14 [fa € S, is of odd order, then a € Ap. 


Proof If ais product of disjoint cycles of lengths m,,m2,...,m,, then its order is 
the least common multiple of m1, m2, ...m,. This is odd if and only if each m; is 
odd. But, then each m; — 1 is even. The result follows from the above proposition.f 


Corollary 6.2.15 All odd ordered subgroups of S,, are also subgroups of Ayn. 


Proof Let H be a subgroup of odd order. Then, by the Lagrange theorem, order of 
every element (which is the order of the subgroup generated by that element) divides 
the order of the group, and so it is of odd order. From the above proposition, it follows 
that all elements of H are in A,. Thus, H is a subgroup of A,. ft 


Proposition 6.2.16 Let H be a subgroup of S, which has no subgroup of index 2 
(in particular if H is simple). Then, H © Ay. 
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Proof Suppose that H g A,. Then, HA, is a subgroup of S, which contains A, 
properly. Since there is no proper divisor of n! which is greater that a it follows by 
the Lagrange theorem that HA, = S,.By the Noether 2,,7 isomorphism theorem, 


{1,-1} © Sp/An = HA,/An © H/H() An. 


This shows that H () A, is a subgroup of H of index 2. This is a contradiction to the 
supposition that H has no subgroup of index 2. ft 


Example 6.2.17 As contains no element of order 6 whereas S; contains 20 elements 
of order 6. For, an element of Ss is of order 6 if and only if it is product of two disjoint 
cycles one of length 2 and another of length 3. These permutations are odd (being 
product of 3 transpositions). Thus, there is no element of order 6 in As. It is already 
seen (Example 5.4.16) that S5 contains 20 elements of order 6. 


Example 6.2.18 Ag contains no element of order 8 whereas Sg contains 7! elements 
of order 8. For, a permutation a may be of order 8 only if in the representation of 
a as product of disjoint cycles, one of the cycle is of length 8. Thus, in Sg cycles 
of length 8 and they are the only elements of order 8. They are obviously odd and 


8 : 
aa = 7! in number. 


Example 6.2.19 Ag contains 2 - if elements of order 15. Indeed, all elements of 
order 15 in Sg (being of odd order) are in Ag. Further, an element of Sg is of order 
15 if and only if it can be written as product of two disjoint cycles, one of length 5 
and the other of length 3. For each choice of cycle of length 5, there are exactly 2 
distinct cycles of length 3 which are disjoint to the chosen cycle of length 5. Thus, 
there are 2 - ; . a elements of Ag of order 15. 


Example 6.2.20 In this example, we give an example of a group in which converse of 
Lagrange theorem is not true. Consider the group A, which is of order 12. We show 
that A4 contains no subgroup of order 6. Suppose the contrary. Let H be a subgroup 
of A, of order 6. Then, H is of index 2 (Lagrange theorem) and so it is normal in 
Aq. Since H is of even order, it contains an element of order 2. The elements of 
order 2 in Ay are (12)(34), (13)(24), (14)(23). Suppose that (12)(34) € H. Since 
H is normal (14)(23) = (123)(12)(34)(132) = (123)(12)(34)(123)~! belongs to 
H. Similarly, (13)(24) € A (in fact all elements of order 2 in Ay are conjugates in 
A4). This shows that V4 is a subgroup of H. This is a contradiction to the Lagrange 
theorem (4 does not divide 6). Thus, there is no subgroup of order 6 in Aq. 


Our next aim is to represent an abstract group as a subgroup of a permutation 
group. 

Let G be a group and X a set. A homomorphism p from G to Sym(X) is called 
a permutation representation of G on X. It is said to be faithful if p is injective 
(ker p = {e})). 

The first result on permutation representation is the following theorem of Cayley. 
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Theorem 6.2.21 (Cayley’s Theorem) Every group is isomorphic to a permutation 
(also called transformation) group (i.e., every group has a faithful representation on 
some Set.). Also a finite group of order n is isomorphic to a subgroup of S,. 


Proof Let G be a group and consider the symmetric group Sym(G) on G. Let 
g €G. Define a map L, from G to G by Lg(x) = gx. The map L, is called the 
left multiplication by g. Suppose that Ly(x;) = Lg(x2). Then, gx; = gx. By 
the cancellation law, xj = x2. Thus, L, is an injective map. Further, given any 
y €G,L,(g-'y) = y. Thus, L, is also surjective. Hence, L, € Sym(G). We have 
amap f from G to Sym(G) defined by f(g) = Lg. Suppose that f(g1) = f(g2). 
Then, Le, = Lg,, and so Ly, (e) = Lg, (e). But, then, g3 = gieé = gre = g>. 
This shows that f is injective. Next, we show that f is a homomorphism. Now 


Ff (g182)() _ Legg) (x) = (g1g2)x = 81(g2X) = Lg, (Lg, (x)) = 
f (gD Cf (g2)(x)) for all x € G. 


This shows that f(gig2) = f(gi)of(g2) for all gi, g2 € G. Thus, f is injective 
homomorphism from G to Sym(G). In turn, it follows that G is isomorphic to the 
subgroup f (G) of Sym(G). Lastly, if G is finite containing n elements, then Sym(G) 
is isomorphic to S,, and hence, in this case G is isomorphic to a subgroup of S,. # 


Remark 6.2.22 If G is a group of order p, where p is prime, then G cannot be 
isomorphic to a subgroup of S,, for any m < p. This is because p does not divide 
m!. Thus, in general, G is the smallest set in the sense of cardinality such that G is 
isomorphic to a subgroup of Sym(G). 


Corollary 6.2.23 Let G be a group of order n which has no subgroup of index 2. 
Then, G is isomorphic to a subgroup of An. 


Proof From the Cayley’s theorem, we have an injective homomorphism p from G to 
S,. Consider yop, where x is the alternating homomorphism from S,, to {1, —1}. Now, 
yop cannot be surjective, for otherwise, by the fundamental theorem of homomotr- 
phism, G/ker yop is isomorphic {1, —1} and ker yop becomes a subgroup of G of 
index 2. Thus, x(p(g)) = 1 forall g € G. This means that p(g) € A, forall g € G. 
It follows that p(G) C A,, and so G is isomorphic to a subgroup of A,. tt 


Since a group of odd order has no subgroup of index 2 (Lagrange theorem), and 
so also a simple group of order > 2 we have the following two corollaries. 


Corollary 6.2.24 Every group of odd order n is isomorphic to a subgroup of An. 


Corollary 6.2.25 If G is a simple group of order n,n > 2, then G is isomorphic to 
a subgroup of An. 


Every permutation of degree 1 can be thought of as a permutation of degree m for 
each m > n which keepsn + 1,n+2,...,m fixed. Thus, S, is a subgroup of S,, in 
a natural way for all m > n. 
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Proposition 6.2.26 S,, is isomorphic to a subgroup of An+2. 


Proof Define a map ¢ from S, to Any2 by d(a) = aif ae Ay, and d(a) = 
a:-(n+1n+42) if a ¢ Ay. Clearly, ¢ is an injective homomorphism. tt 


The following corollary follows from the Cayley’s theorem and the above propo- 
sition. 


Corollary 6.2.27 Every group of order n is isomorphic to a subgroup of Ani2. tt 


Proposition 6.2.28 Let G be a group of order 21, where | is odd.Then, G has a 
subgroup of index 2 (i.e., G has subgroup of order 1). 


Proof Since G is of even order, it has an element a 4 e such that a~! = a. Thus, 
left multiplication L, by a will interchange b to ab and ab to b. Therefore, the per- 
mutation determined by L, in $2; is product of / transpositions which is odd. Hence, 
if ~ represents the injective homomorphism from G to Sz as given in the Cayley’s 
theorem, then ¢(a) is an odd permutation which is product of / transpositions. Thus, 
yow(a) = —1, where x is the alternating map. This shows that yow is a surjective 
homomorphism from G to {1, —1}. By the fundamental theorem of homomorphism, 
G/ker xow is isomorphic to {1, —1}, and hence, ker yow is a subgroup of G of 
index 2. tt 


Corollary 6.2.29 Let G be a group of order 4n+2,n > 1. Then, G cannot be 
simple. ft 


Since the number of subgroups of S,, is finite, and any group of order n is isomor- 
phic to a subgroup of S,,, we have 


Proposition 6.2.30 There are only finitely many nonisomorphic group of order n.t 
Corollary 6.2.31 There are only countably many nonisomorphic finite groups. 


Proof Since countable union of finite sets is countable, the result follows from the 
above proposition. tt 


Proposition 6.2.32 The group S, is generated by {(12), (23),...,(n — 1 n)}. 


Proof Since every permutation is product of transpositions, it suffices to show that 
(ij), i < j belongs to H, where H is the subgroup generated by {(12), (23), ..., (a — 
1 n)}. We show that (i i+ k) € AH for all k > 1. The proof is by induction on 
k. If k = 1, there is nothing to do. Assume that (¢ i+ k) € H for all i. Then, 
G@i+tk+1) = @i+V)G@+1i+1+k)@i+4+ 1 also belongs to H. The result 
follows. tt 


Proposition 6.2.33. The group S, is generated by {(12), (12...n)}. 
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Proof Using the above proposition, it is sufficient to show that (i i + 1) € H for all 
i, | <i <n, where H is the subgroup generated by {(12), (12...2)}. Now 


(12... a)(12)02...n)- = 3). 


and 


(12...n)(23)(12...n)7! = (34). 


Proceeding inductively, it follows that 
(2...n)''12)02.. 7°? = Gi+)1). 
This shows that (i+ 1) € H,andsoH = S,. t 
From the Cayley’s theorem, we have, 


Corollary 6.2.34 Every finite group can be embedded as a subgroup into a group 
which is generated by 2 elements. tt 


Proposition 6.2.35 A, is generated by cycles of length 3. 


Proof Since permutations in A, are products of even number of transpositions, it 
is sufficient to show that product of any two transpositions is a product of cycles 
of length 3. Consider (a3)(76). Itis 1 = (aBy)(ay6) if {a, B} = {7, d}. Next, 
suppose that 6 = y anda #0. Then, (af)(yd) = (aZ)(8d) = (ad). Now 
suppose that {a, 3} (\{7, 6} = @. Then, (a8)(yd) = (yad)(afy). tt 


Proposition 6.2.36 All cycles of length 3 in A,, n > 5 are conjugate to each other. 


Proof Let (a3) and (a 3’ 7) be two cycles of length 3 in A,, n > 5. We can finda 
permutation p € S,, which takes a’ to a, 3 to 3 andy toy. We may assume that p is 
an even permutation, for if not p(uv), where {u,v} (\{a’, 8, y} = G will belong 
to Ay, and it will serve the purpose. Such a pair pu, v exists, forn > 5. It is clear that 


Toph ot 


p'(aBy)p = (apy). tt 


Remark 6.2.37 The above proposition is not true forn = 4. Not all cycles of length 
3 are conjugate in Ay. For example (123) and (132) are not conjugate in A, (verify). 


Theorem 6.2.38 A,,n 4 4 is simple. 


Proof A, and Az are trivial groups whereas A3 is a cyclic group of prime order 3, 
and so simple. Assume that n > 5. Let H be a nontrivial normal subgroup of A,. 
We need to show that H = A,. Since A, is generated by cycles of length 3 (above 
proposition), it is sufficient to show that H contains all cycles of length 3. Since H 
is assumed to be normal, it contains all conjugates of its elements. Since all cycles 
of length 3 are conjugates, it is sufficient to show that H contains a cycle of length 
3. 
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Since H # {J}, it contains a nontrivial permutation. Let p be a permutation in H 
which is different from identity, and keeps maximum number of symbols fixed. We 
shall show that p is a cycle of length 3. Suppose the contrary. Then, there are two 
cases: 


(i) p is product of even number of disjoint transpositions, i.e., 


P = (af)(79).... 


(ii) In the representation of p as product of disjoint cycles, there is a cycle of length 
> 3,1e., 


Consider the case (i). Since n > 5, there exists p € {a, 3,7, 6}. Since H 
Ang = (y6p)p5p)"' = (76p)p(yps) belongs to H. Clearly, p # q for p 
takes 7 to 5 but q takes p to 6. Thus, p-'¢ ¢ H, p-'q #1. Now p”!q fixes all 
symbols fixed by p except possibly p. But p~'q(a) = aand p-'q(B) = 3. This 
means that p~'q is a nontrivial permutation in H which keeps more symbols fixed 
than p does. This is a contradiction to the choice of p. Thus, case (i) cannot occur. 

Consider the case (ii). In this case, if p is not a cycle of length 3, then it must 
change two more symbols 6, ju (say) other than a, @ and ¥, for a cycle of length 4 is 
not an even permutation. Since H < An, g = (you)p(you)! = (yp) p(ypd) 
belongs to H. Now gq is a permutation in H which fixes all symbols fixed by p. Also 
p#q, for p(3) = y and q(3) = 6. Thus, g~'p is a nontrivial permutation in 
H which fixes all symbols fixed by p and it fixes one more symbol (a) which is 
changed by p. Thus, g~!p is a nontrivial member of H which fixes more symbols 
than p does. This is a contradiction to the choice of p. Hence, p is a cycle of length 


3. t 


Since every group of order n is isomorphic to a subgroup of A,+2, we have 
following corollary. 


Corollary 6.2.39 Every finite group is isomorphic to subgroup of a finite simple 


group. 


Example 6.2.40 As has no subgroup of order 30 whereas 30 divides order of As. 
For, a subgroup of order 30, being a subgroup of index 2 (Lagrange Theorem) will 
be a proper nontrivial normal subgroup of As, a contradiction to the above theorem. 


Example 6.2.41 In this example, we find all normal subgroups of S,,. Normal sub- 
groups of $3 and S4 have already been determined. Assume that n > 5. Let H be 
a nontrivial proper normal subgroup of S,. Then, H (| A, is a normal subgroup of 
A,. Since A, is simple H() A, = A, or Hf) A, = {I}. If H(1) An = An, 
then A, C H. Since there is no proper subgroup of S, containing A, properly, 
H = A,. Next, suppose that H() A, = {J}. Since H #4 {1}, HA, = S,. Since 
H(\An = {1}, H is isomorphic to H/H () A,. By the Noether isomorphism the- 
orem, H/H () A, is isomorphic to HA,/A, = S,/An- Since S,/A, is isomorphic 
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to {1, —1}, it follows that H ~ {1, —1}. Thus, H is a subgroup of order 2. Since 
every nonidentity permutation in S, has more than one permutations of same form, 
this is impossible. 


Example 6.2.42 Let G be a group of odd order and ¢ a homomorphism from S,, 
to G,n > 5. From the above example, it follows that ker ¢ is {I} or A, or S,. If 
ker @ = {I}, then by the fundamental theorem of homomorphism, S,, is isomorphic 
to a subgroup of G. But, then, by the Lagrange theorem, n! divides the order of G. 
Since n! is even, this is not possible. Ifker ¢ = A,, then again S,,/A, is isomorphic 
to a subgroup of G. This is also impossible, for G is of odd order. Thus, ker ¢ = Sy, 
and so ¢ is the trivial map. 


S, can be considered as a subgroup of S,, for all m > n. Similarly, A, can be 
considered as a subgroup of A,, for all m > n. Thus, we get two chains 


Si SS. S--- SS S Spr ©... 


and 
A, © AQ C++» CA, SC Any ©... 


of groups. Let Soo = Ue2, Sn and Ago = Ue2, An. Then, Soo is a group which 
contains all S, as subgroups and A. is a group such that all A, are its subgroups. 
Clearly, Ago is a subgroup of S,o. Further, since S, can be considered as a subgroup 
of A, in natural manner, S$, can also be considered as a subgroup of A. In turn, 
So is isomorphic to one of its proper subgroup and also A,, is isomorphic to one of 
its proper subgroup. By the Cayley’s theorem, all finite groups are subgroups (upto 
isomorphism) of S$. and also of Ago. 


Proposition 6.2.43 A. is a simple group. 


Proof Let H be a nontrivial proper normal subgroup of Ago. Since A, C An+1 for 
alln, Ax = Uso, An for all m. Since H is nontrivial H (1) Am 4 {I} for some 
m > 5. But, then, H () A, 4 {7} for all n > m. Since H < Ajo, H () An < An Vn. 
Since A, is simple for alln >5,H()A, = A, for all n> m. But then, A, C 


H for alln => m. Hence, Ax. C H. This shows that H = Ag. tt 


Remark 6.2.44 Since Ag is anormal subgroup of S,. and Ag is simple, S$. is not 
isomorphic to Ag. 


Exercises 


6.2.1 Let f be an injective map from X to Y. Let g € Sym(X). Define a map 
E(f)(g) from ¥ to ¥ by E(f)(g)(y) = flex) if y = f(x) and y if y¢ 
Ff (X). Show that E(f)(g) € Sym(Y). Show further that E(f) defines an injective 
homomorphism from Sym(X) to Sym(Y). Show also that if g is another injective 
map from Y to Z, then E(gof) = E(g)oE(f). 
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6.2.2 Find the product a3 of permutations 
_ (123456 
“= \352614 
B= 123456 
~ \231654 


6.2.3 Express the following permutations as product of disjoint cycles and find their 
orders. Determine which of them are even and which of them are odd. 


_ (1234567 
“= 14326715 
g = (123456789 
=~ 1945763821 


_ (123456789 1011 
Y=\469810251117 3 


and 


5 — (12345678 
— (25431786 


6.2.4 Express the permutations in Exercise 6.2.3 as product of transpositions. 


6.2.5 Show that $); contains no elements of order 35. Find the number of elements 
of order 35 in Sj. Show that they are all in Aj. Show that they are all conjugates. 
Find the number of elements in $2 and also in Ajz which commute with a given 
permutation of order 35. 


6.2.6 Show that S;}5 contains no element of order 16 whereas Sj¢ contains several 
elements of order 16. Find the number of elements of order 16 in Sj¢. Does Aj¢ 
contain an element of order 16. Find the number of elements in $,g which commute 
with a given element of order 16. 


6.2.7 Find the number of elements of order 55 in Sj¢. 
6.2.8 Find the number of permutations in $,) commuting with a cycle of length 5. 


6.2.9 Find the number of even permutations in S, which commute with a cycle of 
length r. 


6.2.10 Let H be a subgroup of S, all of whose nonidentity permutations are odd. 
Show that | H |= 1 or 2. 
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6.2.11 Find the number of conjugacy classes of A, and also of As. 
6.2.12 Do A, and S, contain same number of conjugacy classes? Support. 
6.2.13 Is every conjugacy class in A, also a conjugacy class in S,,? Support. 


6.2.14 Show that S|, contains an element of order 18. Find the number of such 
elements. 


6.2.15 Is the converse of Lagrange theorem true in Ag? Support. 
6.2.16 Find the number of elements of order 21 in Ajo. 

6.2.17 Show that As and in fact A, > 5 is generated by 2 elements. 
6.2.18 Show that A, is generated by 2 elements. 

6.2.19 Show that the center Z(S,) = {I},n > 3. 

6.2.20 Show that Z(A,) = {7} foralln > 4. 


6.2.21 Show that the group /nn(S,,) of inner automorphisms of S, is isomorphic to 
Sn. 


6.2.22 Show that the group Jnn(A,),n > 41s isomorphic to A,. 


6.2.23 Let a € S, be cycle of length n and 7 a transposition in S,,. Show that S,, is 
generated by {a, T}. 


6.2.24 Find a subgroup of Sg isomorphic to Qg the Quaternion group of degree 8. 


6.2.25 Obtain a subgroup of $4 isomorphic to Z4. Can Z,4 be isomorphic to a sub- 
group of A? 


6.2.26 Let G be a group and g € G. Let R, denote the right multiplication by g 
(x ~» xg). Show that g ~~» R, defines an anti isomorphism from G to a subgroup of 
Sym(G) (a bijective map f from a group to a group is said to be an anti isomorphism 
if f(ab) = f(b) f(a) Va, b). Show also that g ~» R,-: defines an isomorphism from 
G to a subgroup of Sym(G). 


6.2.27 Find the a subgroup of Sg which is isomorphic (and also a subgroup which 
is anti isomorphic) to the dihedral group D4. 


6.2.28 Letn = nj +n2+---+n, bea partition of n. Show that S,, x Sp, +--+ x 
Sn, 1S isomorphic to a subgroup of S,,. Are subgroups corresponding to different 
partitions isomorphic? Support. 
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6.2.29 Show that V4 is the only nontrivial proper normal subgroup of A4. 
6.2.30 Show that Ag contains no element of order 180. 
6.2.31 Show that every nonabelian simple subgroup of S,, is also a subgroup of A,. 


6.2.32 Let H be anontrivial subgroup of S,, such that H g Ay. Show that H contains 
a subgroup of index 2. 


6.2.33 Find | C4, (a) |, where a is a cycle of odd length r. 


6.2.34 Show that every homomorphism from A, n > 5 to a group G is either injec- 
tive or trivial homomorphism. 


6.2.35 Show that every nontrivial endomorphism of A,,, 1 > 5 is an automorphism. 
6.2.36 Show that there are at most "’~!C,_; nonisomorphic groups of order n. 
6.2.37 Show that no group of order 4n + 2, n > 1 is simple. 


6.2.38 Show that the number of elements of order 21 in Ajo is double the number 
of elements of order 7. 


6.2.39 Give an example of an infinite simple group. 


6.2.40 Show that all finite groups are isomorphic to a subgroup of Sg and also to a 
subgroup of A... Deduce that there are countably many finite nonisomorphic groups. 
Show that S,. is isomorphic to one of its proper subgroup. 


6.2.41 Is Sym(N) © Sx? Support. 

6.2.42 Is Soo <1 Sym(N)? Support. 

6.2.43 Is Sym(N) simple? Support. 

6.2.44 Show that S$, # Ag. 

6.2.45 Show that the order of each element of A,q is finite. Show also that it contains 
elements of any finite order. 

6.2.46 Show that the additive group Q of rational numbers is isomorphic to a sub- 
group of Sym(N). 

6.2.47 Is the additive group Q of rationals isomorphic to a subgroup of $..? Support. 
6.2.48 Is the group P of roots of unity isomorphic to a subgroup of Ao? Support. 


6.2.49 Let a € S, such that it is product of m, cycles of length r;, m2 cycles of 
length r2,...,m; cycles of length r,, r; = 2 for all j. Show that the number of 
permutations which have same form as a is 
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n! 


(n — myry — morg — +++ — myr;)!- (my)Iry" + (m2) Ir5 «(mI 
Hence, find the order of Cs, (a). 
6.2.50 What is the normal subgroup of S,, generated by a transposition? 


6.2.51* Let G be a group and H a subgroup of G. A set S obtained by selecting one 
and only one member from each right coset of G modulo H with e € S is called 
a right transversal of H in G. Determine a right transversal of (i) S,_; in S,, (ii) 
S, X Ss,r+s = nin Sy, (iii) S, in Any and (iv) A,_; in A,. Determine also, if 
possible, right transversals which generate the corresponding groups. 


6.2.52 Show that S, x S;,r +s = n is a maximal subgroup of S,. Give another 
type of maximal subgroups of S,,. 


6.2.53* Show that on every nonempty set X there is a group structure. 

Hint. If X contains n elements, then there is a bijective map from X to the cyclic 
group Z,,. If X is infinite, then there is a subgroup of Sym(X) which has the same 
cardinality as X. 


6.2.54* Use the above exercise to show that there is no set containing all groups. 


6.2.55* Show that there is no set X of groups such that every group is isomorphic to 
a member of X. 


6.2.56* Show that there is no set containing all groups isomorphic to a given group. 
6.2.57 Find the centralizer of (12) in S,,. 
6.2.58 Find all normal endomorphisms of S, and also of Aj. 


6.2.59* Let p be an element of order 2 in A,, > 5. Show that there is an element 
q € A, such that {p, gq} generate A,. 


6.2.60 Determine the number of involutions in S,,, and also in A,. 


6.3 General Linear Groups 


The purpose of this section and the following section is to introduce some important 
classical matrix groups, and study some of their fundamental properties. We shall 
not discuss any of their topological properties. Indeed, many algebraic properties 
follow from topological considerations. Further properties of these groups will be 
discussed in Algebra 2 and Algebra 3. 

An xX m matrix with entries in a set X is an arrangement of n rows and m columns. 
Thus, an x m matrix A with entries in X is given by 
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411 Gi2"+ 00° aim 

421 422 "- "+ 42m 
A= ; 

Gn1 Gn2- + Anm 


where a;; € X for all i, 7. We denote this matrix by [a;;], where a;; denote the ith 
row jth column entry of the matrix, and call it a matrix of order n x m with entries 
in X.Ifn = m, then it is said to be a square matrix of order n. The set of n x m 
matrices with entries in X is denoted by M,,,,(X). The set of all square matrices of 
order n is denoted by M,,(X). 

Consider the set My (IR) of n x m matrices with entries in R. All the matrices in 
this section will have their entries in R, unless stated otherwise. Let A = [a;;] and 
B = [bj] be members of My (IR). We define the sum A + B of A and B to be the 
matrix C = [c;;], where cj; = a;; + bj; for alli, j. This defines a binary operation 
+ on My (R) such that (Mim (CR), +) is an abelian group. O,.. denotes the n x m 
matrix all of whose entries are 0. This matrix is called the zero matrix, and it is the 
additive identity of the group (M,,,, (IR), +). The negative of the matrix A = [a;;] 
is the matrix —A = [b;;], where b;; = —a;; for alli, j. 

Let A = [a,j] be an x m matrix and B = [bjx] am x p matrix with entries 
in R. The product A - B of A and B is defined to be the n x p matrix C = [cjx], 
where 

Cik = win aij d jx 


for all i, k. The following properties can be easily verified: 


(i) The matrix multiplication - is associative, i.e, (A-B)-C = A-(B-C) forall 
matrices A, B and C, where the relevant products are defined. 

(ii) Matrix multiplication - distributes over +,i.e., A-(B+C) = (A-B) + (A- 
C),and(A+ B)-C = (A-C) + (B.C) forall matrices A, B and C, where 
the relevant sums and products are defined. 


Evidently, A -On xp = Onxp, and Opxn-A = Opxm for all n x m matrices A. 
For each n, we have the square n x n matrix [,,,., all of whose diagonal entries are 
1 and all off diagonal entries are 0. The matrix J/,,,.,, is called the identity matrix of 
order n. Clearly, A+ Inxm = A = Inxn- A forall A € Mym(R). 

General linear group: The pair (/,, (IR), -) isa semigroup with identity. However, 
(M,,(R),-) is not a group, for 0,., has no inverse. A matrix A € M,,(R) is said 
to be an invertible matrix (also called a nonsingular matrix) if AB = [nxn = 
BA for some B € M,(R). Let A and B be invertible matrices with entries in R. 
Then, there is a matrix C anda matrix D such that AC = I,,, = CAandBD = 
Inxn = DB. But, then (AB)(DC) = Inyn = (DC)(AB). This shows that the 
product of any two invertible matrices is an invertible matrix. Thus, the multiplication 
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of matrices induces a multiplication on the set GL(n, R) of invertible matrices, and 
GL(n, R) is a group with respect to this multiplication. This group is called the 
General linear group. 

Diagonal subgroup: A matrix is said to be a diagonal matrix if all its off diagonal 
entries are 0. A diagonal matrix whose diagonal entries are 1, 2, ... , A; is denoted 
by Diag(\,, A2,.--, An). Observe that the effect of multiplying Diag(\), A2,..., 
An) from left to a matrix A = [a;;] is replacing the ith row of A by 4; times the ith 
row of A. More explicitly, Diag(Aj, A2,..-,An)- A = B = [bj], where bj; = 
Vidij- Similarly, A: Diag(A,, ro, aici Am) = [C = cil, where Cj = dj aij: In 
particular, Diag(Aj, A2,..-, An) + Diag (1, Ha,---,hn) = Diag(rA, M1, A2p2, 

.-;AnLn). It follows that a diagonal matrix is invertible if and only if all its 
diagonal entries are nonzero, and then the inverse of Diag(A,, A2,..-, An) is 
Di ag(\;', wee Hes bas Let D,,(R) denote the set of all diagonal matrices with 
nonzero entries. The above discussion ensures that D, (R) is a subgroup of GL(n, R) 
called the diagonal subgroup. Clearly, D, (IR) is isomorphic the direct product of n— 
copies of the multiplicative group R* of nonzero real numbers. For n > 2, the sub- 
group D,, (R) is not anormal subgroup of GL (n, R). For, consider the diagonal matrix 
Diag(A,, A2,---; An), Where A; # Az. Let Ee denote the matrix all of whose diago- 
nal entries are 1, the first row second column entry is also 1, and the rest of the entries 
are 0. Then, E ty is invertible, and its inverse is the matrix EF = all of whose diagonal 
entries are |, the first row second column entry is —1, and the rest of the entries are 
0. Check that Ee - Diag(A,, A2,--+;An)* EB is anondiagonal matrix whose first 
row second column entry is —A; + A2. Describe the normalizer of D,,(R), and also 
the centralizer of D, (IR). 

Borel subgroup B(n, R): A square n x n matrix A = [a;;] is called an upper 
triangular matrix if all below diagonal entries are 0, i.e., aj; = 0 for alli > j. Let 
B(n, R) denote the set of all invertible upper triangular matrices. Observe that the 
product of upper triangular matrices are upper triangular. Suppose that A = [a;;] 
is an invertible upper triangular matrix. Then, aj; = 0 for alli > j, and there is a 
matrix B = [b;;]suchthatB- A = I[,,,. Equating the first row first column entries 
of both sides, we obtain that b};a,; = 1.Hence,a,;; 4 Oandby,; = di # 0. Next, 
equating the second row first column entries of both sides, we get that b2;a;; = 0, 
and so bz; = 0. Again, equating the second row second column entries, we get that 
b21a12 + b72A22 = 1.Since b> = 0, bx2d422 = 1. Hence, a2 # O and bo = as, # 
0. Further, equating the third row first column entries, we obtain that b3;a;,; = 0. 
This means that b3; = 0. Now, equating the third row second column entries, we 
get that b3;a12 + b32a2. = 0. But, then b324a27 = 0. This shows that bz. = 0. 
Proceeding inductively, we find that a;; 4 0, bj, = a;; i # 0, and b;; = 0 for all 
i > j. This shows that the inverse of an invertible upper triangular matrix is again an 
invertible upper triangular matrix. Thus, B(n, IR) is a subgroup of GL(n, R). This 
subgroup and all its conjugates are called the Borel subgroups. 

Unipotent subgroup U(n, R): Let U(n,R) denote the set of all matrices in 
B(n, R) with all diagonal entries 1. It can be checked that U(n, R) is a subgroup of 
GL(n, R). This subgroup is called a unipotent subgroup of GL(n, R). Indeed, any 
conjugate of U(n, R) is called a unipotent subgroup of GL(n, R). 
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Let A be an x n matrix. The matrix obtained by interchanging the rows in to the 
corresponding columns is called the transpose of A, and it is denoted by A‘. Thus, 
if A = [a;;], then A’ = [b;;], where bj; = aj;. It is easy to observe the following: 


(i) (A+ By) = A’ + B’. 
(ii) (A- B)' = B'. A’. 
(iii) If A is invertible, then A’ is also invertible, and (A~!)' = (A‘)~!. 


An Xn matrix A is said to be a symmetric matrix if A’ = A. A matrix A is 
said to be a skew symmetric matrix if A’ = —A. 


Elementary Matrices, Permutation Matrices 


Fori ~ j and \ € R, consider the matrix E * all of whose diagonal entries are 1, the 
ithrow jthcolumn entry is 4, and the rest of the entries are 0. The matrices of the form 
E * are called transvections or shear matrices. Clearly, E Ff = Inxn + ex, where 
ef is the matrix all of whose entries are 0 except the ith row jth column entry which is 


A. Itis easy to observe that et ‘ ei, = On xn. In turn, it follows that Ey ; Ej; = as 
Consequently, E} : ES = hi = oie eee Thus, E} € GL(n, R) for all natural 
numbers i, j <n, and \ € R. Also check that E}, and E3, do not commute. Thus, 
GL(n, R) is an infinite noncommutative group for all n > 2 (note that GL(1, R) is 
the multiplicative group R* of nonzero real numbers). Clearly, for each i, j; i 4 j, 
there is an embedding j1;; of the additive group R of real numbers to GL(n, R) given 
by wij(A) = E a. The subgroup E(n, R) of GL(n, R) generated by the transvections 
is called the elementary subgroup of GL(n, R) (we shall have another interpretation 
of E(n, R)). 

Let p be a permutation of degree n. Let M,, denote the matrix obtained by permut- 
ing the rows of the identity matrix /,,,., through the permutation p, and M? denote 
the matrix obtained by permuting the columns of the identity matrix through the 
permutation p. These matrices are called the permutation matrices. It can be easily 
observed that for any matrix A = [a;;], M, - A is the matrix obtained by permuting 
the rows of the matrix A through the permutation p. In particular, M,-M, = My, 
for all permutations p and q. This defines an embedding M of the symmetric group S, 
in to the group GL(n, R) given by M(p) = M,. Consequently, using the Cayley’s 
theorem, we have the following proposition. 


Proposition 6.3.1 Every group of order n is isomorphic to a subgroup of 
GL(n, R). t 


Definition 6.3.2 A square n x n matrix A is called an elementary matrix if it is 
one of the following types: 


(i) A transvection E a 
Gi) A permutation matrix M,-, where 7 = (ij) is a transposition. 
(iii) A diagonal matrix Ds all of whose diagonal entries are | except the ith diagonal 
entry, and the ith diagonal entry is A 4 0. 
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Clearly, all the elementary matrices are invertible matrices. Indeed, the inverse of 
E} iS Ee the inverse of MV, is M, itself, and the inverse of De 1S De Hence, they 
are all members of GL(n, R). We shall show that the set of all elementary matrices 


generate GL(n, R). 


Definition 6.3.3. The following operations on matrices are called the elementary 
row (column) operations: 


(i) Add 4 times a row (column) of the matrix to another row (column) of the matrix. 
(ii) Interchange any two distinct rows (columns) of the matrix. 
(ii) Multiply a row (column) by a nonzero real number. 


Remark 6.3.4 It is easily observed that multiplying E Ee from left to a matrix A is 
equivalent to adding \ times the /th row of A to the kth row of A, and multiplying 
it from right to a matrix A is equivalent to adding X times the kth column of A to 
the /th column of A. Also, multiplying M- from left to a matrix A is equivalent to 
interchanging the kth row and /th row of A, where T is the transposition (k, /). Further, 
multiplying M, from right is equivalent to interchanging the kth and /th columns. 
Finally, multiplying the kth row (column) of A by A is equivalent to multiplying the 
elementary matrix De from left (right). 


Definition 6.3.5 Am xn matrix A = [aj;;] is said to be a matrix in reduced row 
(column) echelon form, or it is said to be a reduced row echelon matrix if the 
following hold: 


(i) The first nonzero entry in each row (column) is |. This entry is called a pivot 
entry, and the corresponding columns (rows) are called the pivot column(row) 
of the matrix. The columns (rows) which are not pivot columns (rows) are called 
free columns (rows). 

Gi) The pivot entry in any row (column) is toward right (bottom) side to the pivot 
in the previous row (column). 

(iii) All of the rest of the entries in a pivot column (row) are 0. 

(iv) All the zero rows (columns) are toward bottom (right). 


Example 6.3.6 The matrix 
12002 
00101 
00012 
00000 


is in reduced row echelon form. The Ist row Ist column, 2nd row 3rd column and 
3rd row 4th column entries are pivot entries, 2nd and 5th columns are free columns. 


Proposition 6.3.7 A reduced row (column) echelon matrix is invertible if and only 
if it is the identity matrix. 


Proof The result follows if we observe that a matrix having a zero row (column) is 
not invertible. tt 
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Proposition 6.3.8 Using elementary row/column operations, every matrix can be 
reduced to a matrix in reduced row/column echelon form. 


Proof Let Abe am x n matrix. If A is the zero matrix, then it is already in reduced 
row echelon form. Suppose that A is nonzero matrix. Let j, be the least number 
such that the j,th column is a nonzero column. Further, let i; be the smallest number 
such that aj, ;, 4 0. Interchanging the 7; th row and the first row, we may assume that 
aj, £0,anda;, = Oforallk < jj, and for alli. Multiplying the first row by ai ;, ; 


we may assume that a,;, = 1, and aj, = Oforalli andk < j,. Next, adding 
—ajj, times the first row to the ith row for each i > 2, we reduce A to a matrix [a;;], 
where aj, = Oforallk < j,; —1,a,;, = 1, anda;;, = 0 for alli > 2. If in this 


reduced matrix a;; = 0 for alli > 2 and j, then it is already in reduced row echelon 
form. If not, let jz be the smallest number such that a;;, 4 0 for some i > 2. Further, 
let iz be the smallest number greater than 2 such that aj, ;, 4 0. Note that j2 > ji. 
Interchanging the i2th row and the second row, we may assume that a2;, 4 0. Then, 
multiplying the second row by Gees we may assume that a2;, = 1. In turn, adding 
—ajj, times the second row to the ith row for each i ¢ 2, A may have been reduced 
to a matrix in reduced row echelon form. If not, proceed as before. This process 
reduces A in to reduced row echelon form after finitely many steps (if worst comes, 
at the nth step). Similarly, using elementary column operations, every matrix can be 
reduced to a matrix in reduced column echelon form. tt 


Corollary 6.3.9 LetAben x nmatrix A. Then, there is a finite sequence E,, Er, ..., 
E,, of elementary matrices such that 


Bypekigsssss Boek 


is in reduced row echelon form, and also there is a finite sequence E}, E}, wougeke 
of elementary matrices such that 


E,.En-+++-E,- A=], = AE, - E,-++-+ E, 
Proof Let A be an x n matrix. From the Remark 6.3.4 and Proposition 6.3.8, there 
is a finite sequence E), E2,..., E, of elementary matrices such that 
E,-E--:--E,-A 


is a matrix in reduced row echelon form. Since all elementary matrices are invertible, 
A is invertible if and only if 


E,+Eo+++--E,-A 
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is invertible. Again, by Proposition 6.3.7, a reduced row echelon matrix is invertible 
if and only if it is the identity matrix. Similarly, the remaining part also follows. 1 


Since the inverse of an elementary matrix is an elementary matrix, we have the 
following corollaries. 


Corollary 6.3.10 Every invertible matrix is a product of elementary matrices. Fur- 
: es sige. : >I at. 

ther, a square n x n matrix A is invertible if and only if A. X = O implies that 

X = 0, where X denote the unknown row vector and 0 is the zero row vector. 


Proof Since the inverse of an elementary matrix is an elementary matrix, the first 
assertion is immediate from the above corollary. Now, assume that A is invertible and 
A-X =0. Then, Xx =A'.0 =0. Conversely, suppose that A is singular. 
Then, the reduced column echelon form of A will have its last column as zero column 
vector o. From the above corollary, there is a finite sequence E), E2,..., E, of 
elementary matrices such that 


is a matrix in reduced column echelon form. Since the last column of this reduced 
aio Ae 
column echelon form is 0 , 


a t 

A Ey > Boxsets E,-@,' = 0, 
where @, is the nonzero vector whose all entries are 0 except the nth column entry 
which is 1. Since the elementary matrices are invertible, FE; - E.----- E,-@,' isa 
nonzero column vector. The result follows. tt 


Corollary 6.3.11 The general linear group GL(n, R) is generated by the set of 
elementary matrices. tt 


Determinant map 


Definition 6.3.12 The determinant is a map det from M,,(R) to R defined by 


n 
det(A) = Epes, signp |], apa 


where A = [a;;]. The image det (A) is called the determinant of A. 


Consider the n x m matrix A = [a;;]. This matrix can be faithfully represented 
by a column 
ry 


ln 
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of n rows, where 7; denote the ith row vector 


[aj1, 4i2, +--+, dim] 
of the matrix A. 
Similarly, the n x m matrix A = [a;;] can be faithfully represented by a row 
[ei Gay 295 Spe 1: 


of m columns, where a denote the jth column vector 


aj 
a2j 


anj 
of the matrix A. 
Proposition 6.3.13 The determinant map det from M,(R) to R satisfies the follow- 
ing properties: 


(i) det Inxn) = 1. 


(ii) det is a linear map on the rows of the matrices in the sense that 


= adet a + bdet 


for each i. 

(iii) Let A = [ajj] bean x n matrix. Let B = [bij] be the matrix obtained by 
interchanging the kth row and the Ith row of A, i.e., byj = aij, bij = aj and 
bij = aj; for alli ¢ {k,l}. Then, det(B) = —Det(A). In other words, if we 
interchange the rows of a matrix, then the determinant changes its sign. 


Proof (i) Since all off diagonal entries of the identity matrix J[,,,., is 0, for each 


nonidentity permutation p, a,j); = O for some i. Also sign of identity per- 
mutation is 1. Hence, Det(Inxn) = []j-;4ii = 1 
(ii) Let 
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1 <i <n, and fora fix k, 
Tk = [dey , pos ++ +> Ay. 


Then, 


Th 
. ’ 
= Upes, SLSNP (AM p(p— (k)) p(k) a BA y(p-14)p-1(k)) I segs Ap(ii 


. 1 . / 
= aXpes,signp |] apa ate BUpes, SiSNPA y(-1(%)) pH) Wl ee Api(i)i 


= adet| _ | + Gdet = 
rk r 


ln ln 


(iii) Let B = [b;;] be the matrix obtained by interchanging the kth row and the /th 
row of A = [a;;]. Let 7 denote the transposition (k, /). Then, 
nN n 
det(B) = Epes,signp |] pai = Eves,signp |] ano = 
nN nN 
— Upes, SiQNT P IL, 47(p(i)i = —Uges,Signg IL, Agiyi = —det(A).t 
Conversely, we shall see that the map det as given by the Definition 6.3.12 is the 
only map which satisfies the conditions stated in the Proposition 6.3.13. 


Corollary 6.3.14 Jf A = [aj;j] isan x n matrix whose two distinct rows are same, 
then det(A) = 0. 


Proof Suppose that the kth row and the /th row of A are same. Interchanging the kth 
row and /th row, the matrix remains the same, and so, from the Proposition 6.3.13, 
det(A) = —det(A). Hence, det(A) = 0. tt 
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Proposition 6.3.15 det(E-A) = det(E)det(A) for all elementary matrices E, 
and for all matrices A. In fact, 


(i) det(E;\) = |, and det(E;\ - A) = det(A) = det (E;\)det(A) for all 
transvections E be and for all matrices A. 
(ii) det(M,) = —1, and 


det(M,- A) = —det(A) = det(M,)det(A) 


for all transpositions T, and for all matrices A. 
(iii) det(Dg) = , and 


det(D;- A) = Adet(A) = det(D?)det(A) 
for all elementary matrices De, and for all matrices A. 


Proof (i) From the Proposition 6.3.13 (ii), det is a linear map on the rows of the 
matrix. Hence, 
det(E) = det(UInxn) + Adet ([bij}), 


where [b;;] is a matrix whose kth row and /th rows are same. By the Corollary 
6.3.14, det (EQ) = |. The rest of the identities in (i) follow from the same 
observation. 

(11) Follows from Proposition 6.3.13 (iii). 

(ii) This follows from the fact that the det is a linear map on rows. tt 


Proposition 6.3.16 An x n matrix A is invertible if and only if det (A) is non zero. 
Further, if Ais invertibleand A = E\E,...E,, thendet(A) = det(E,)det(E2)... 
det(E,). 


Proof Let A be an x n matrix. Then, there is a finite sequence FE), E2,..., E, of 
elementary matrices such that FE, - Ey----- E,.- A is in reduced row echelon form. 
From the above proposition, det(E,-E2-----E,-A) = det(E,)det(E)... 


det (E,)det (A). If A is a noninvertible matrix, then there is a zero row in its reduced 
row echelon form, and since the determinant of a matrix with a zero row is zero, 


it follows that det(A) = O. Further, if A is invertible, then EF, - E,-----E,- 
A = I, xn. Hence, from the Proposition 6.3.15, 1 = det(E,- E2----- E,-A) = 
det (E\)det(E2)...det(E,)det(A). This shows that det (A) 4 0. The final assertion 
also follows from the Proposition 6.3.15. tt 


The following is an immediate corollary. 
Corollary 6.3.17 GL(n, R) = {A € M,,(R) | det(A) 4 0}. tt 


Proposition 6.3.18 det(A-B) = det(A)Det(B) for all matrices A and B. 
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Proof Suppose that A (or B) is noninvertible, then both sides are zero, and so 
the equality holds. Suppose that both the matrices are invertible. Suppose that 
A=E,-E),-----E,, where E; is elementary matrix for each i. Then, det(A- 
B) = det(E,- E,.---- E, + B). Further, from the Proposition 6.3.15,det(A - B) = 
det (E\)det(E2)...det(E,)det(B). Again, from Proposition 6.3.16, det(A) = det 
(E) )det(E2) ...det(E,). This shows that det(A - B) = det(A)det(B). tt 


Corollary 6.3.19 The determinant map det is a surjective homomorphism from 
GL(n, R) to the multiplicative group R* of nonzero real numbers. 


Proof The fact that det defines a map from GL(n, R) to R* follows from the Propo- 
sition 6.3.16. The above proposition ensures that the map det is a homomorphism. 
If \ € R*, then det (D2) = . Hence, it is a surjective homomorphism. tt 


The kernel of the determinant map is the set SL(n, R) of all n x n matrices with 
determinant 1. Thus, SZ(7, R) is a normal subgroup of GL(n, R). This group is 
called the special linear group. From the fundamental theorem of homomorphism, 
we have the following corollary. 


Corollary 6.3.20 GL(n, R)/SL(n, R) is isomorphic to the multiplicative group R* 
of nonzero real numbers. ft 


Following result is an immediate consequence of the Corollary 6.3.10 and the 
Proposition 6.3.16. 


Corollary 6.3.21 Let A be an x n matrix. Then, det(A) = 0 if and only if there 
is a nonzero vector a@ such that A-a@ = 0. 


6.4 Classical Groups 


Proposition 6.4.1 Let A bean x n skew symmetric matrix which is invertible. Then, 
n is even. More explicitly, ifn = 2m + | is odd, and A is skew symmetric n x n 
matrix, then det(A) = 0. 


Proof Observe, by the definition, that det(—A) = (—1)"det(A) for anyn xn 
matrix. Suppose that A is a skew symmetric n xn matrix. Then, det(A) = 
det(A') = det(—A) = (—1)"det(A). Thus, if n is odd, then det(A) = 0. tt 


Proposition 6.4.2 Let P be a symmetric matrix (skew symmetric) in GL(n, R). 
Then, the set 
O(P) = {A€éM,(R)|A-P-A’ = P} 


is a subgroup of GL(n, R). 
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Proof Let A € O(P). Then, A: P- A’ = P. Since P is invertible, det(A)det(P) 
det(A') 4 0. Hence, det(A) # 0, and so A € GL(n, R). Let A, B € O(P). Then, 


(A-B)-P-(A-B)! = A-B-P-B'.A' = A-P-A' = P. 
This shows that A - B € O(P). Further, suppose that A € O(P). Then, A- P - A’ 


P.Intum, P = A7!.P.(A)"! = A7!.P.-(A7!). This shows that A7! 
O(P). 


aM Il 


Definition 6.4.3 Two n x n matrices P and Q are said to be congruent to each 
other if there exists a nonsingular matrix A such that APA' = Q. 


The relation of being congruent to is an equivalence relation on the set of alln x n 
matrices. 


Proposition 6.4.4 Let P and Q be nonsingular symmetric (skew symmetric) matri- 
ces in GL(n, R) which are congruent. Then, O(P) and O(Q) are conjugate in 
GL(n, R). In particular, they are isomorphic. 


Proof Let L be a nonsingular matrix such that LQL' = P. We show that L~'!O(P) 
L = O(Q).Let A € O(P). Then, APA‘ = P,andso ALQL'‘ A’ = P. This implies 
that LD-'ALOL'A'(L')~! = L7! P(L')“!. In turn, L~-'ALO(L~!AL)! = QO. This 
shows that L~'AL € O(Q). Thus, L~'O(P)L © O(Q). Similarly, LO(Q)L~! C 
O(P). Hence, L~'O(P)L = O(Q). tt 


The following proposition classifies the congruent classes of symmetric (skew 
symmetric) matrices. The proof of the proposition can be found in Algebra 2. 


Proposition 6.4.5 For each pair (p,q) of nonnegative integers with p-+q = n, 
let J(p, q) denote the n x n diagonal matrix whose first p diagonal entries are —1 
and the rest of the q diagonal entries are 1. Then, any nonsingular symmetric n x n 
matrix is congruent to a unique matrix J (p,q). Also there is a unique congruence 
class of nonsingular 2m x 2m skew symmetric matrices. tt 


Different choices of nonsingular symmetric (skew symmetric) matrices P give 
rise to different groups O(P) which play crucial roles in the respective geometries. 


Orthogonal Groups 


Orthogonal groups play a very crucial role in Euclidean and spherical geometries. 
Indeed, they are the group of isometries of the Euclidean and spherical spaces. 
Consider the symmetric matrix P = I[,,,. The group O(/,x,) is denoted by 
O(n). Thus, O(n) = {A € GL(n,R) | AA’ = I,,,,}. This group is called the 
orthogonal group. The members of O (7) are called the orthogonal matrices. If A is 
an orthogonal matrix, then (det (A))? = 1. Thus, det inducesa homomorphism from 
O(n) to the two element group {1, —1}. The diagonal matrix diag(—1, 1, ..., 1) isan 
orthogonal matrix with determinant —1. Hence, det is a surjective homomorphism 
from O(n) to {1, —1}. The kenel of this homomorphism is the normal subgroup 
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{A € O(n) | Det(A) = 1} of O(n). This subgroup is denoted by SO(n), and it 
is called the special orthogonal group (also called the rotation group). From the 
fundamental theorem of homomorphism, we get the following corollary: 


Corollary 6.4.6 O(n)/SO(n) is isomorphic to {1, —1}. tt 


> 2 > ul. 
Let X be a row vector with n columns. Then, X - X is areal number. Let 


A = [X1,X2,..., Xn] 
be an x n matrix, where xX; represent the jth column of the matrix. Then, A‘A = 
[X; - xk Thus, A is orthogonal if and only if Xe xX, = 1, and X;- x, = 
Ofori A j. Similarly, if we represent A as a column of row vectors whose ith vector 
is Rj, then AA’ = [R; - Rj |. In turn, A is orthogonal if and only if R; - R, = 1 
and R;- Rj = Ofori # j. 


Proposition 6.4.7 An x n matrix A is an orthogonal matrix if and only if (AX)! . 
AY =X-Y for all vectors X and Y. 


Proof Suppose that A is orthogonal. Then, A‘'- A = I,. Hence, 


(AX Va AY 2X AMAGY = Xie = Ko 


for all vectors X and Y. Conversely, suppose that (AX)'- AY = X ".¥ for all 
vectors X and Y. Let @ denote the vector with ith column 1 and the rest of the 
entries 0. Then, A - 2" is the ith column X;_ of the matrix A. Hence, X; - x; = 
(A-@')'-A-@;' = @-@;". Clearly, @-2@' = 1, and @-2;' = Ofori # j. It 
follows that A is orthogonal. tt 


Proposition 6.4.8 Let A bean x n orthogonal matrix, wheren = 2m + 1 is odd. 
If det(A) = +1, then there is a vector X with || X ||) = X- X = 1 such that 
ye ae ee 


Proof Suppose that A is orthogonal matrix with det(A) = +1. Then, 


det(A—+I,) = det(A')det(A—+I,) = det(A'(A—+I,)) = 
det(I, —+A‘) = det(I, —4A) = (—1)"det(A—+I,). 


Since n is odd, det(A — £I,,) = 0. From the Corollary 6.3.21, there is a nonzero 

vector Y such that (A — +/,) -Y’ = 0. This means that A-Y = +Y’. Take 
xX = —LY. 

Vr : 

Let P(n, R) denote the set of all n x n permutation matrices. Then, P(n, R) isa 

subgroup of GL(n, R) isomorphic to the symmetric group S,. Recall that the map M 

from S, to GL(n, R) given by M(p) = M, is an injective homomorphism whose 
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image is P (n, R) (by definition). For each transposition 7, M, is an orthogonal matrix 
of determinant —1. Indeed, M, is symmetric, and (M,)* = Inyn. Since P(n, R) 
is generated by the matrices of the type M,, it follows that P(n, R) is a subgroup 
of O(n). The following corollary is consequence of the Cayley’s theorem and the 
above observation. 


Corollary 6.4.9 Every finite group of order n is isomorphic to a subgroup of 
O(n). f 


Corollary 6.4.10 Let G be a finite group of order n which has no subgroup of index 
2. Then, G is isomorphic to a subgroup of SO(n). 


Proof det o M is a homomorphism from G to the two element group {1, —1}. 
Since G has no subgroup of order of index 2, (det o M)(g) = 1 forall g EG. 
Hence, det(M(g)) = 1. This shows that M is an injective homomorphism from G 
to SO(n). tt 


Example 6.4.11 Let 


be a member of O(2). Then, the condition that AA’ = 1,2 implies the following 
identities: 


ijat+bP=1=c +d’. 
(ii) ac + bd = 0. 

Indeed, then we also have 

(iii) a2 + c? = 1 = c? + d?, and 
(iv) ab + cd = 0 


Hence, for a, there is a unique angle 6,0 < 6 < m such thata = cos@.Then,b = 
+sin@.Ifb = sin@, then the conditions (ii) and (iii) imply that(c = sinOandd = 
—cos6) or(c = —sin@Oandd = cos@). Again, if b = —sin6, then the conditions 


(i1) and (iii) imply that (c = sin@andd = cos@)or(c = —sinOandd = —cos@). 
Thus, there is a unique angle 6, 0 < 0 < 7 such that 


oe cos? sind as cos@ —sin@ 
= | +sin@ zcoso | 0” = | +sin@ +cosé |" 


In particular, A € SO(2) if and only if there is a unique angle 0, 0 < @ < 7 such that 


Ae cos0 | 


tsin@ cosé 
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The correspondence A +> cos@ + isin@ gives an isomorphism from $O(2) to the 
circle group S!. 


Example 6.4.12 Consider the group SO(3). Let A € SO(3). From Proposition 
6.4.8, there is a unit vector X in the Euclidean space R? suchthat A: X = xX’. Let 
Y, Z bea pair of mutually orthogonal unit vectors in the plane orthogonal to X. More 
explicitly, X-X =Y-Y =Z-Z =landX-Y =X-Z =Y-Z =0. 
Thus, the matrix P = [x , y ; Z] is also an orthogonal matrix in O (3). Interchang- 
ing the second and the third vector, if necessary, we may suppose that P € SO(3). 
Further, 


= t 


S¥i:A Ase = Fo =O, 


Hence, Y - At is orthogonal to the vector X, and so it lies in the plane determined by 
Y and Z. Similarly, Z - A‘ also lies in the plane determined by Y and Z. Suppose 
that 


| 


-A' = aY + DZ, 


and 

Z-A' = cY + dZ. 
Since A‘ is an orthogonal matrix, Y - A’ and Z - A’ are pairwise orthogonal unit 
vectors. Hence, 


+P alae +a, (6.4.1) 


and 
ac + bd = 0. (6.4.2) 


Further, 


PisAsP-= PufAsd ALY AZ] SP le av 4 07 c¥ & dz), 


Thus, 
X-X X-(@Y + bZ)X-(c¥ + dZ) 100 
P'.A-P = Y-X Y-(aY + bZ) Var a7) = |0ac 
ZeX Zs@Y 4:07) 04Y 4 az) Obd 


Since P'’ = P™!, equating the determinant of both sides, we get that det(A) = 
ad — bc. Thus, 
ad — bc = 1 (6.4.3) 


From the Eqs. (6.4.1)-(6.4.3), we obtain a unique 6,0 <@<7 such thata = 
cos@,c = +sin0,b = +sin0,and d = cos6. Thus, A is conjugate to a unique 
matrix of the type 
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1 O 0 
0 cosO +sin®@ |, 
0 +sin@ cosé 


where 0 < 6 < =. 


Lorentz Groups 


Lorentz groups play a very crucial role in hyperbolic geometry. Indeed, they are the 
group of isometries of the hyperbolic spaces. 

Let J denote the (n + 1) x (n + 1) diagonal matrix diag(—1, 1, 1,..., 1). Then, 
J is nonsingular symmetric matrix. We denote the group O(/J) by O(1, n). Thus, 


O(1,n) = {Ae GLin+1,R)|A-J-A’ = J}. 


The group O(1, 7) is called the Lorentz orthogonal group. The members of O(1, 7) 
are called the Lorentz matrices. 

If A is a Lorentz matrix, then det(A)? = 1. Clearly, J is a Lorentz matrix whose 
determinant is —1. Thus, det induces a surjective homomorphism from O(1, 7) to 
the two element group {1, —1}. We denote the kernel of this map by SO(1, 1) and 
call it the special Lorentz group. Thus, SO(1,n) = {A € O(1,n) | det(A) = 1}. 
By the fundamental theorem of homomorphism, O(1, 2)/SO(1, 1) is isomorphic to 
the group {1, —1}. 

A Lorentz matrix A = [aj;] is called a positive Lorentz matrix if a,; > 0. It 
can be checked with a little effort that the product of any two positive Lorentz matrix 
is a positive Lorentz matrix. Thus, the set PO(1,7) of positive Lorentz matrices 
is a subgroup of O(1,n). This group is called the positive Lorentz group. Let 
PSO, n) denote the set of all positive Lorentz matrices with determinant |. Then, 
PSO(1,n) is a subgroup of PO(1,n), and it is called positive special Lorentz 
group. 


Example 6.4.13 We try to describe PSO(I, 1), POI, 1) and O(1, 1). Let 


be a member of PSO(1, 1). Then, the conditions that AJA’ = J, a> 0, and 
det(A) = 1 imply the following identities: 


i) a —-P=1=2 — a. 


(ii) ac = bd. 
(iii) ad — be = 1. 
Since a > 0, a > 1. Hence, there is a unique x > 0 such thata = coshx and 


b = sinhx.The condition (ii) implies thatc = sinhx andd = coshx. Thus, 


coshx sinhx 
sinhx coshx 


PSOU, 1) = ( ]lneR 
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Further, it can be observed that SO(1,1) = {+A]|Ae€ PSO(I, 1)} and 
o(d,1) = Sod, )U Dp, sou, 1). 
Symplectic Group 


So far, we discussed examples of the groups O(P), where P is a non singular sym- 
metric matrix. Now, we consider the case where P is a nonsingular skew symmetric 
matrix. Observe that all nonsingular 2m x 2m skew symmetric matrices are congru- 
ent to each other. Hence, we shall get the unique group upto isomorphism. Consider 


the matrix Jo, given by 
Jon a Onxn ‘a . 


—Thxn Onxn 


The group O(J2,,) is denoted by $P(2n, R), and it is called the symplectic group 
of 2n x 2n matrices. Thus, 


SP(2n,R) = {A € GL(2n,R) | AJA’ = Jon}. 


ab 
te 
be a 2 x 2 matrix. Observe that AJ>A' = J) if and only if ad — bc = 1. Thus, 
SP(2,R) = SL(2, R). However, in general, SP(2m,R) C SL(2m,R). 


Example 6.4.14 Let 


We may replace R by C or by Q in most of the discussions in this section to 
get other groups. Thus, GL(n, C) denote general linear group with entries in C and 
SL(n, C) denote the special linear group of n x n matrices with entries in C. 


Unitary Groups 
Let A = [aj] be an xn matrix with entries in C. The conjugate A of A is 
defined to be the matrix [b;;], where bj; = ajj. The matrix A’ = JZ is called 


the tranjugate or hermitian conjugate of A, and it is denoted by A*. Clearly, (i) 
(A+ B)* = A*+ B*, ii) (AB)* = B*A*, and (iii) det(A*) = det(A).Anxn 
matrix A € GL(n, C) is called a unitary matrix if A* = A7!. It is easy to observe 
that the set U(n) of all unitary matrices in GL(n, C) form a subgroup of GL(n, C). 
The group U (n) is called the unitary group. Let A € U(n). Then, 


1 = det(Inxn) = det(AA*) = det(A)det(A) =| det(A) |’. 


Thus, det induces a homomorphism from U(n) to the circle group. Clearly, diag 
(z,1,1,..., 1) € U(n) for all z € S', and then, det ((diag(z, 1,1,...,1)) = z€ 
S!. This shows that det is a surjective homomorphism fromU (n) to S!. The kernel 
of det is denoted by SU(n), and it is called the special unitary group. By the 
fundamental theorem of homomorphism, 
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U(n)/SU(n) & S?. 


Example 6.4.15 Let 
s= [a] 
be a member of SU(2). Then, AA* = hy. anddet(A) = 1. Equating the corre- 
sponding entries of the matrices, we get the following identities: 
ja? + |bP=1s[eP + | 4/, 
ac + bd = 0, andad — be = 1. 


This shows thatd = ad,andc = —b. Thus, SU (2) consists of the matrices of the 


type 
a b 
ae | cal: 


where |a [> + |b/? = 1. 
Exercises 


6.4.1 Show that {A € GL(n, R) | det(A) € Q} is anormal subgroup of GL(n, R). 
Is this the same as the subgroup GL(n, Q) of GL(n, R)? 


6.4.2 Show that GL(n, R) (SL(n, R)) can viewed as a subgroup of GL(n + 1, R) 
(SL(n + 1, R)) through the embedding i, given by 


: = A Onx1 
in (A) —_ Fe 1 |: 


6.4.3 Recall that a group G is the semi direct product K < H if there is a normal 
subgroup H anda subgroup K of G suchthatG = HK,andH()K = {e}. Prove 
the following: 


(i) Let G be a group and (3 a surjective homomorphism from G to a group K. Let 
t be ahomomorphism from K to G such that Got = Ix. Show that G is the 
semi direct product K < ker (3. 

(ii) GL(n, R) is the semi direct product R* ~ SL(n, R). 

(iii) O(n) is the semi direct product {1, —1} ~ SO(n). 
(iv) U(n) is the semi direct product $' < SU(n). 


6.4.4 Show that GL(n, R) and SL(n, R) are not finitely generated. 


6.4.5 Let A be a nonsingular matrix in GL(n, R). Show that A can be reduced to 
the diagonal matrix De e'(A) by using the elementary row and column operations of 
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the type (i) as described in the Definition 5.5.3, i.e., adding a nonzero multiple of a 
row/column to an other row/column. 
Hint. See the proof of the Proposition 5.5.8 and modify it suitably. 


6.4.6 Use the above exercise to show that every nonsingular matrix A is expressible 
as 
_ At d2 »r det(A) pri ba Hs 
A= Ey’. Eo. ...E;". Dy ae ee 


if ~ia je * ty Jr 


6.4.7 Use the above exercise to show that the group SL(n, R) is generated by the 
set of all transvections. 


6.4.8 Show that GL(n,, R) x GL (nz, R) is isomorphic to a subgroup of GL(n; + 
n2, R). 


6.4.9 Is GL(n, R) isomorphic to GL(m, R) for n 4 m? Support. 

6.4.10 Describe the centralizer of a transvection E o 

6.4.11 Describe the centralizer of M,. 

6.4.12 Describe the centralizer of De. 

6.4.13 Describe the centers of the groups GL(n, R), SL(n, R), O(n) and U(n). 
6.4.14 Find the commutator subgroups of GL(n, R), SL(n, R), and O(n). 


6.4.15 Describe the conjugacy classes of matrices in O(3) having the determinant 
-1. 


6.4.16 Interpret the members of SO(3) as rotations in R? about an axis passing 
through the origin. 


6.4.17 Interpret the matrices of determinant —1 in O(3) as reflections in IR? about 
a plane passing through origin. 


6.4.18 Describe the conjugacy classes of subgroups of SO (3). 


6.4.19 Let Ae SOCI, n), where n is even. Show that there is a nonzero vector 
X € R"™'! suchthatA-X = XxX. 


6.4.20 Describe the conjugacy classes in SO(1, 2), and also the conjugacy classes 
of subgroups of SO(1, 2). 


6.4.21 Show that every element of SU (2) is uniquely expressible as 


10 i 0 01 0i 
[oi] +a[o]+e[4o]+[%9] 


where (do, @1, 42, a3) belongs to the unit 3-sphere S?. Deduce that SU (2) is isomor- 
phic to the group S? described in Exercise 4.1.26. 


Chapter 7 
Elementary Theory of Rings and Fields 


Ring is an another important algebraic structure with two compatible binary opera- 
tions whose intrinsic presence in almost every discipline of mathematics is frequently 
noticed. The theory of rings, in the beginning, will be developed on the pattern the 
theory of groups was developed. 


7.1 Definition and Examples 


Definition 7.1.1 A ring is a triple (R,+,-), where R is a set, + and - are binary 
operations on R such that the following three conditions hold. 


1 (R, +) is an abelian group. 
2 (R,-) is a semigroup. 
3 The binary operation - distributes over + from left as well as from right. Thus, 


Gi) a-(b+c) =a-b+a-c 


Gi) (a+b)-c =a-c+b-c 
for alla,b,c ER. 


Remark 7.1.2. The condition 3 in the above definition is the compatibility condition 
between the two binary operations. 


Example 7.1.3 (Z, +, -), (Q, +, -), QR, +, -), (C, +, -) are all rings(+ and - 
are the usual addition and multiplication). 


Example 7.1.4 (Zm, ©, *) (see Chap. 3) is a ring. 


Example 7.1.5 Z[/2| = {a+b 2 | a,b € Z} is aring with respect to the addition 
and multiplication induced by those in R. We can replace 2 by any integer in this 
example and get other rings. 
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Example 7.1.6 Zi) = {a+b /-1 | a,b € Z} is a ring with respect to the 
addition and multiplication induced by the addition and the multiplication of complex 
numbers. This ring is called the ring of Gaussian integers. 


Example 7.1.7 Z[/—5] = {a+ b/—5 | a,b € Z} is also a ring with respect to 
the usual addition and multiplication induced by those in C. 


Example 7.1.8 The set M,(Z) of n x n matrices with entries in Z is a ring with 
respect to the addition + and multiplication - of matrices 


In the above example, we can replace Z by Q, R or C to get other matrix rings. 
In fact, we can replace Z by any ring to get a matrix ring M,,(R) with entries in the 
ring R. 


Example 7.1.9 Let H denote the set of all 2 x 2 matrices with entries in the field C 
of complex numbers which are of the form 


el 


where a denotes the complex conjugate of a. Then, H is a ring with respect to 
the addition and multiplication of matrices (verify). This ring is called the ring of 
Quaternions. 


Example 7.1.10 Let (M, +) be an abelian group and End(M) denote the set of all 
group endomorphisms of M. Then, End(M) is a ring with respect to the operations 
® and o, where @ and 0 are defined by 


(7 ® p)(m) = nlm) + pm), 
and 
(nop)(m) = n(p(m)) 
for all m € M (verify). 


Example 7.1.11 Every abelian group (M, +) can be made a ring by defining the 
multiplication - in M by a-b = 0. This ring is called the zero ring on M. 


Example 7.1.12 Let (Ri, ®1, *1) and (Ro, Bo, *2) be rings, and R = Rj, xX Ro. 
Define the operations © and « on R by 


(x1, 2) B® 1, ¥2) = (1 B1 y1, X2 B2 yo), 


and 
(X1,.X2) * 1, ¥2) = (1 *1 V1, X2 #2 V2). 


Then, R is aring with respect to these operations. This ring is called the directproduct 
of R, and R>. 
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Example 7.1.13 Let (R, +, -) be aring. Define operations © and * on Z x R by 
(n, a) © (m, b) = (n+m,a+b), 


and 
(n, a)*(m,b) = (nm,nb+ma-+ ab). 


Then, Z x R is a ring with respect to these operations (verify). 


Example 7.1.14 Let (R, +, -) be aring, and X be a set. Let R* denote the set of all 
mappings from X to R. Then, R* is a ring with respect to the binary operations @ 
and x defined by 

fF @ gx) = f(x) + g@) 


for all x € X, and 


(f x g(x) = f(x) +g) 


for all x € X. 


7.2 Properties of Rings 


Proposition 7.2.1 Let (R, +, -) bearing. Let 0 denote the additive identity (the zero 
of the ring) of the ring and —a be the inverse of a in (R, +). Then, 


(i) a-0=0=0-a 

(ii) a-(—b) = —(@-b) = a-(-b) 

(iii) (—a)-(—b) = a-b. 
foralla,be R. 


Proof Gi) O+a:0 = a-0 = a-(0+0) = a-0+<a-0 (by the distributive 
condition). By the cancellation law in the group (R, +), we get thata-0 = 0. 
Similarly, using the right cancellation law, we get thatO-a = Oforallae R. 

(ii) a-(—b) + a-b = a-(—b+b) (by the distributive condition) = a-0 = 0 (by 
(i)). Hence a-(—b) = —(a-b) forall a, b € R. Similarly, (—a)-b = —(a-b). 

(iii) Follows by applying (11) twice. tt 


Remark 7.2.2 In the proof of the above proposition, we have not used the commu- 
tativity of +. Only the cancellation law and the distributive condition is used. Thus, 
the above proposition is also true for an algebraic structure (R, +, -) satisfying all 
properties of a ring except (possibly) the commutativity of +. 


The following example illustrates the impact of the multiplication on the addition 
through the distributive condition. 
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Example 7.2.3 Let (R, +, -) be an algebraic structure which satisfies all the postu- 
lates of a ring except (possibly) the commutativity of the addition. Suppose that there 
is ac € R which can be left canceled in the sense that 


ca = cbimplies thata = b. 


Then, + is necessarily commutative, and so (R, +, -) becomes a ring. 


Proof Using Proposition 7.2.1(ii), and the distributivity, we have 


e-(-(@@+6)) = (-c)-(@+b) = (-e)-a + (-e)-b = 
c-(—a) +¢-(—b) = c- ((—a) + (—5)). 


Canceling c, we obtain that —(a+b) = (—a) + (—b). Taking the inverses, we get 
a+b = b+aforalla,beR. 


Remark 7.2.4. The cancellation of an element c in the above proposition is a condition 
on (R, -) which controls (through the distributive condition) the addition by making 
it commutative. Similarly, addition also has control over multiplication. 


A ring (R, +, -) is called a commutative ring ifa-b = b-aforalla,beéR. 
It is said to be with identity if there exists an element 1 4 0 inR such that 1-a = 
a = a-1 forall a € R. This element 1 is called the identity of R. Examples 
7.1.3-7.1.7 are all commutative rings with identities. Examples 7.1.8 and 7.1.9 are 
noncommutative rings with identities. The identity of M,(Z) is the identity matrix 
Inxn- It can be seen easily that E¢ - Ey # Ey Et, a £0, 6 £0, i # k. Thus, 
M,,(Z) is a noncommutative ring for n > 2. The ring RX is commutative ring if and 
only if R is commutative (verify). R* has identity if and only if R has identity(prove 
it). The set 2Z of even integers with respect to the usual addition and multiplication 
of integers is a commutative ring without identity. The ring Z x Z is a commutative 
ring with identity (1, 1) with respect to coordinate-wise addition and multiplication. 


Example 7.2.5 Let (R, +, -) be ring such that a> = a for all a € R. Then, the ring 
is commutative, anda+a = Oforallae R. 


Proof a+a = (ata)? = (at+a)-(ata) = @4+a@4+0+a = atatata 
for all a € R. By the cancellation law,a+a = Oforalla € R. Next, 


(a+b) = (a+b) = (at+b)(at+tb) = a +ab+ba+b? =a+ab+bat+b. 


By the cancellation property in (R, +),ab + ba = 0.Inturn,ab + ba = 0 = 
ab + ab. By the cancellation law, ab = ba. 
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Theorem 7.2.6 (Binomial Theorem) Let (R, +, -) be a commutative ring. Then, 
(a+ by” = a" + "Cia b +--+ "Cab +---+ b= wg "Ca" 'b". 
foralla,b€ RandneéN. 


Proof The proof is by induction on n. Ifn = 1, then there is nothing to do. Assume 
that the result is true for n. Then, 


(a+b)" = D9"Ca"'o’. 


Now, 
(a+ b)"*! = (a+b)"-(a+b) = (arg "Ca" "b")(a + b). 


By the distributive law and the commutativity of -, 
(a ae py! = ae "Ca" tp a ae AC gr tpt. 
Thus, 
(a+ by*! = qitl + = "C.a'tl'pr + E, "C._1a™ "tp 4 ptt, 
Since ”C, + "C,_; = "t!C,, we have 
(a +4 py! _— psa mC gitl—rpr 


tt 


Example 7.2.7 Let (R, +, -) be a ring with identity 1. Let a be an element of R which 
has more that one left inverse with respect to the multiplication in R. Then, it has 
infinitely many left inverses. In particular, the ring R is infinite. Thus, in any finite 
ring with identity, if left inverse (right inverse) of an element exists, then it is unique. 


Proof Let X be the set of all left inverses of a. Then, since a has more than | left 
inverse, none of them are right inverses of a. Let ag € X. Then, dp-a = 1. Ifx € X, 
then (ax—1+ag)a = axa—a+aga = a-1—a+l1 = l,andso(ax—1+do) € X. 
Thus, we have a map f from X to X defined by f(x) = ax — 1 + apo. Suppose that 
f@) = f@). Then, ax—1+ada) = ay—1+ ap. Multiplying by x from left and 
using the fact that x is a left inverse of a, we get that x = y. Thus, f is injective. We 
show that f(x) 4 ao for allx € X. If f(x) = ax—1+ado = ad, thenax = 1, and 
so x will also be a right inverse, a contradiction. It follows that f is not surjective. 
Hence, X is an infinite set. tt 
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7.3 Integral Domain, Division Ring, and Fields 


Let (R, +, -) bearing. An element a € R is called a left(right) zero divisor if there 
exists a nonzero element b in R such thatab = 0 (ba = 0). Thus, 0 is always a 
zero (left as well as right) divisor in any nontrivial ring. In Ze, 2 is a left as well as 
right zero divisor, for2*3 = 0 = 32 whereas 3 ¥ 0. In M,(Z), the matrix 


i i #j is a zero divisor, for (e4)? is the zero matrix. 


e 


Proposition 7.3.1 Let (R, +, -) be a ring. Then, the following conditions are equiv- 
alent. 

1. R has no nonzero left zero divisors. 

2. R has no nonzero right zero divisors. 

3.Ifa-b = 0, thena = Oorb = 0. 

4. If a and b are nonzero elements of R, thena-b 4 0. 

5. Restricted cancellation law holds in R in the sense that 


[a 4 0anda-b = a-c]implies thatb = c, 


and 
la £Oand b-a = c-al] implies thatb = c 


6. The multiplication - induces a binary operation in R* = R— {0} with respect 
to which R* is a semigroup with cancellation law. 


Proof 1 => 2. Assume |. Let b be a right zero divisor. Then, there is a nonzero 
element a € R such thata-b = O. But then b = 0, for otherwise a will be a 
nonzero left zero divisor. Thus, the ring has no nonzero right zero divisors. 

2 => 3. Assume 2. Suppose thata-b = 0.Ifb = 0, then there is nothing to 
do. If b ¥ 0, then since R has no nonzero right zero divisors, b cannot be a right zero 
divisor, and soa = 0. 

3 <=> 4. This equivalence is a tautology. 

4 => 5. Assume 4 (and so 3 also). Leta £ 0 anda-b = a-c. Then, 
0 = a-b—-a-c = a-b+a:-(-c) = a-(b+(-c)). Sincea £ 0, b+ (—c) = 0. 
Hence b = c. 

5 = > 4. Assume 5. Suppose thata-b = 0.Ifa = 0, then there is nothing to 
do. Ifa £0, thena-b = 0 = a-0O. By the restricted cancellation law, b = 0. 

5 = > 6. Assume 5 (and so 3 and 4 also follows). Because of 4, the multiplication 
- induces a binary operation on R* = R — {0}, and because of 5, R* is a semigroup 
with cancellation law. 

6 => |. Assume 6. If there is a nonzero left zero divisor a, then there is a nonzero 
element b such thata-b = 0. But thena, b € R*,anda-b ¢ R*. This contradicts 6. 


Definition 7.3.2 Aring (R,+,-), R 4 {0} is called anintegral domain if it satisfies 
any one (and hence all) of the six conditions in the above proposition. 


Examples 7.1.3, 7.1.5, 7.1.6, and 7.1.7 are all integral domains. 
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Proposition 7.3.3. The ring (Zm, ©, *) is an integral domain if and only if m is 
prime. 


Proof Suppose thatm = pisaprime andO0Q = @*b = ab. Then, p divides 
ab. Since p is prime, it divides a or b. But then@ = Oorb = O. Thus, Zp is 
an integral domain. Suppose that m is not prime, and som = my, - my for some 
my, Mz such that 1 < m, < mand 1 < m < m. But thenm € 0 4 mz, whereas 
mM, *™> = mm = m = O. Hence, in this case, Z,, is not an integral domain. { 


b 
0 


For example, Ze is not an integral domain, whereas Z7 is an integral domain. The 
ring M,,(Z) is not an integral domain for et, is a zero divisor. 


Example 7.3.4 Let C[0, 1] denote the set of all real-valued continuous functions 
from the closed unit interval [0, 1]. Since the sum and product of continuous functions 
are continuous, C[0, |] is a commutative ring with identity(the constant function 
which takes every member of [0, 1] to 0 is the additive identity and the constant 
function which takes every member to | is the multiplicative identity). It is not an 
integral domain: Let f be the function on [0, 1] defined by f(x) = oifx € [0, 5] 
and f(x) = (x — $)? ifx e [4, 1] and g the function given by g(x) = (x — 3) if 
x € [0, 5] and 0 at rest of the places. Then, f and g are nonzero elements of C[0, 1] 
whose product is zero function. What can you say about the ring of differentiable 
functions? What can we say about the ring of analytic functions on a domain in the 
complex plane? 


Remark 7.3.5 On every abelian group (M, +), we can define a multiplication - by 
a-b = Oforall a,b € M. Then, (M,+,-) becomes a ring(called the zero ring 
on M). Can we always define addition + on a semigroup (R, -) so that (R, +, -) 
becomes a ring? Obviously, the answer to this question is in negative. For example, 
(R, -) can never be a group. There should be an element 0 € R such thata-0 = 0 = 
0-afor all a € R. Even on such semigroup, we may not be able to define an addition 
+ so that (R, +, -) becomes a ring. The characterization of multiplicative semigroup 
of a ring may be a difficult problem. 


Proposition 7.3.6 Let (R, +, -) be a ring containing p elements, where p is a prime. 
Then, it is a zero ring, or a commutative integral domain(compare with the corre- 
sponding result in groups). 


Proof Since R contains p elements, (R, +) is a cyclic group of order p in which 
every nonzero element is a generator of the group (R, +). Suppose that (R, +, -) is 
not an integral domain. Then, we have a pair of nonzero elements a,b € R such 
thata-b = 0. Let x,y © R. Since a as well as b generate (R,+),x = na and 
y = mb for some n,m é€ {1,2,...,p— 1}. Hence, xy = na-mb = nma-b 
(by the distributive condition)= 0. Thus, if (R, +, -) is not an integral domain, then 
it is a zero ring. Further, let a € R*. Then, the group (R, +) is generated by a. Let 
x,y € R. Then,x = naandy = ma forsomen,m é€ {1,2,...,p— 1}. Clearly, 
xy = na-ma = nma? = ma-na = yx. Thus, the ring is commutative. ft 
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Let (R,+,-) be a ring. An element e € R is called an idempotent element if 
e’ = e. Anelement a € Ris called a nilpotent element if a” = 0 for somen > 0. 
If (R, +, -) is with identity 1, then an element v is called a unit (or invertible) if there 
is an element v € R such thatu-v = 1 = v-u. The set of units of R is denoted 
by U(R). An element u is called unipotent if uw — 1 is nilpotent, or equivalently if 


u = 1+ 5 for some nilpotent element b. 


Example 7.3.7 Let Y be aset and X the power set of Y. Then, X is a ring with respect 
to the addition as symmetric difference and multiplication as the intersection. Every 
element of this ring is idempotent (verify). It is a commutative ring with identity Y. 
There is no unit except the identity. There is no nilpotent element except the zero 
(i.e., @). There is no unipotent element except the identity (i.e., Y) element. It is not 
an integral domain (why?). 


Example 7.3.8 In Ze, 3 is an idempotent which is neither a unit nor a nilpotent 
element. This is also not a unipotent element (verify). 


Example 7.3.9 In the ring M,(Z) of n x n matrices with entries in Z, all strictly 
upper triangular matrices are nilpotent (verify). All upper triangular with diagonal 
entries | (such matrices are called the unitriangular) are unipotent (verify). 


Proposition 7.3.10 Let (R,+,-) be an integral domain. Let e € R, and u be a 
nonzero element of R such that eu = u = ue. Then, e is the identity. 


Proof Leta € R. Then, u- (ea—a) = ua—ua = O. Since R is an integral domain 
andu #0, ea = a. Similarly, ae = aforallae R. tt 


Corollary 7.3.11 The only nonzero idempotent element in an integral domain is the 
identity of the integral domain. tt 


Proposition 7.3.12 Let (R, +, -) be a ring with identity, and u be a unipotent element 
of the of the ring. Then, u is a unit of R. 


Proof Suppose thatu = 1 — b, where b" = O for some n € N. Then, 


(bil bb ee) = 1S bbe PAP are ay tA = 


Similarly, (1 + b+---+5b""')(1 — b) = 1. This shows that u = 1 — bisa unit, 
and its inverse is 1 +b+---+b7-!. t 


Remark 7.3.13 If a and b are nilpotent elements of R which commute with each 
other, then a + b is also nilpotent: Suppose that a” = 0 = b". Then, by the 
Binomial theorem(note that in the proof of the binomial theorem, we only need the 
fact that a and b commute), we get that (a + b)"t™ = 0. Thus, in particular, if 
u = 1-—b isa unipotent element, then its inverse 1+b-+---+ b'—! is also a 
unipotent element. Let us denote the set of unipotent elements of R by Uni(R). If 
u € Uni(R), then u~! € Uni(R). In general, product of unipotent elements need not 


1 


be unipotent. The matrices E;; and E; are unipotent in M,,(Z), but there product is 


not (verify). Clearly, Uni(R) C U(R). 
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Proposition 7.3.14 Let (R, +, -) be a ring with identity 1. The multiplication in R 
induces a multiplication in the set U(R) of units with respect to which U(R) is a 


group. 


Proof Clearly 1 € U(R). Let a,b € U(R). Then, there exist c,d € R such that 
ac = ca = bd = db = 1.But,thenabdc = ac = 1. Also,dcab = 1. Hence, 
ab € U(R). Further, if a € U(R), then there exists c € Rsuchthatac = 1 = ca. 
Clearly, c € U(R). This shows that U(R) is a group with respect to the induced 
multiplication. ft 


A maximal subgroup of U(R) consisting of unipotent elements is called a 
unipotent subgroup of U(R). The group of unitriangular matrices in M,(C) is a 
unipotent subgroup of U(M,,(C)). 


Example 7.3.15 The group U((Z, +, -)) is the cyclic group {1, —1} of order2.a € Z 
is a unit if there isa b € Z such thatab = 1. Butthen|a|-|b |= 1. Hence, 
|a|= l,andsoa = lora = —-l. 


Example 7.3.16 The group U((Zm, ®, *)) is the group U,, of prime residue classes 
modulo m: a is a unit in Z,, if there is an element b in Z,, such thata@*b = 1. But 
then ab — | is divisible by m. This means that a and m are co-prime, and soda € Uj». 


Example 7.3.17 Consider the ring Z[i] = {a+ bi | a, b € Z} of Gaussian integers. 
Then, U(Z[i]) = {1, —1, i, —i} is the cyclic group of order 4: Suppose that a + bi 
is a unit. Then, there is an element c + di € Z[i] such that (a + bi) - (c+ di) = 1. 
Taking square of the modulus of both sides, we get that a? + b* = 1. This means 
thata = +tlandb=Oorb = +l anda =O. Hence,a+bi = 1 or—1orior—i. 


Definition 7.3.18 A ring (R,+,-) is called a division ring or a skew field if the 
binary operation - induces a binary operation on R* = R— {0} with respect to which 
it is a group. A commutative division ring is called a field. 


Thus, a division ring is always an integral domain. The integral domain Z of 
integers is not a division ring, for Z* is not a group (2 has no inverse). The rings 
(Q, +, -), (R, +, -) and (C, +, -) are all fields, for (Q*, -), (R*, -), (C*, -) are 
all commutative groups. 


Proposition 7.3.19 A ring (R, +, -) with identity 1 4 0 is a division ring if and only 
if for all nonzero element a in R, there is an element a~‘in R such thata~!-a = 
l= aa", 


Proof Suppose that (R, +, -) is a division ring, and a € R*. Then, since - induces a 
binary operation on R* with respect to which it is a group, there is an element b in 
R* such thata-b = b-a = 1. Conversely, suppose that for all a € R*, there is an 
element a~! € R such thata-a~! = 1 = a! -a.Then, U(R) = R*. t 
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Example 7.3.20 The Quaternion ring (H, +, -) (Example 7.1.9) is a non commu- 
tative ring. We show that it is a division ring (which is not a field). Let 


eA 


be anonzero element of H. Then, | a |? + | b |?4 0.Letusdenote| a |? + | b /?40 
by 6. Then, it can be checked that 
; a 
56 


is the inverse of the above element, and it belongs to H. 


Ia 
| 

aS 

Ld 


Proposition 7.3.21 Every finite integral domain is a division ring. 


Proof Let (R,+, -) be a finite integral domain. Then, - induces a binary operation 
in R* with respect to which it is a finite semigroup in which cancellation law holds. 
Since a finite semigroup in which cancellation law holds is a group, R* is a group 
with respect to the induced operation. This shows that (R, +, -) is a division ring. { 


Remark 7.3.22 The finiteness assumption in the above proposition is essential, for 
(Z, +, -) is an integral domain which is not a division ring. 


The following is an immediate corollary of the above proposition. 
Corollary 7.3.23 Every finite commutative integral domain is a field. tt 


Remark 7.3.24 In fact, every finite integral domain (without the assumption of com- 
mutativity) is a field. A more general result is that a division ring in which every 
nonzero element is of finite multiplicative order is a field. The proof of this assertion 
can be found in Algebra 2. 


Corollary 7.3.25 (Zn, ®, *) is a field if and only if m is prime. 


Proof By proposition 7.3.3, (Zm, ©, *) is a finite commutative ring which is an 
integral domain if and only ifm is prime. The result follows from the above corollary. ff 


Corollary 7.3.26 A ring containing prime number of elements is either a zero ring 
or a field. 


Proof By Proposition 7.3.6, a ring containing prime number of elements is either 
a zero ring or a commutative integral domain. The result follows from the fact that 
every finite commutative integral domain is a field. ft 


Remark 7.3.27 Let (R, +, -) be aring such that (R, +) has no proper subgroup, then 
| R | is prime, and so it is a zero ring or a field. This illustrates that a condition on 
(R, +) puts a restriction on (R, -). 
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Proposition 7.3.28 Let (D,+,-) be a division ring containing q elements. Then, 
a! = aforallae D. 


Proof Ifa = _ 0, then there is nothing to do. Suppose that a 4 0. Then, a € D*, 
and since D* is a group with respect to the induced multiplication containing g — 1 
elements, it follows that a7! = 1. Thus, a? = a. tt 


Characteristic of an Integral Domain 


Proposition 7.3.29 Let (R, +, -) be an integral domain. Then, additive order of any 
two nonzero elements are same. 


Proof It is sufficient to show that ma = O ifand only ifmb = 0 foralla,b € R*. 
Let a,b € R*. Suppose that ma = 0. Then,0 = (ma)-b = a- (mb) (by the 
distributive law). Since a # 0, and R is an integral domain, mb = 0. Similarly, 
mb = Oimplies that ma = 0. tt 


Definition 7.3.30 Let (R, +, -) be an integral domain. If additive order of a nonzero 
element (and so of all nonzero elements) is infinite, then we say that R is of 
characteristic 0. If order of a nonzero element (and so of all nonzero elements) 
is finite and m, then we say that R is of characteristic m. The characteristic of R is 
denoted by charR. 


Proposition 7.3.31 The characteristic of an integral domain is either 0 or else a 
prime number. 


Proof Suppose that charR = m ¥ 0, and m is not prime. Then, m = mympz for 
some m,, m2 with 1 < m; < m,1 < m < m. Let a be a nonzero element of R. 


Since R is integral domain, a* 4 0. Hence m is additive order of a as well as a’. 


Now, 0 = ma? = mm a* = mya- moa. Since R is an integral domain, mja = 0 


or ma = 0. This is a contradiction to the supposition that m is order of a. ft 


Every division ring (field) is an integral domain, and hence, characteristic of a 
division ring or a field is zero or a prime number. 


Corollary 7.3.32 If (R, +, -) is a finite integral domain, then the characteristic of R 
is some prime p such that p divides | R |. 


Proof Since R is finite, no element can be of infinite order. Thus, characteristic of R 
is a prime p which is additive order of any nonzero element of R. Since order of an 
element divides the order of the group, p divides | R |. 


Corollary 7.3.33 Order of a finite integral domain (division ring or a field) is p" for 
some prime p andn > 0. 


Proof Since R is finite, the characteristic of R is some prime p. Thus, the order of 
each nontrivial element of (R, +) is p. No other prime g will divide the order of R, 
for otherwise, by Cauchy theorem for abelian groups, (R, +) will have an element 
of order g. Hence, | R | = p” for some n > 0. tt 
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In particular, there is no integral domain of order 6. 
Characteristics of (Z, +, -), (Q, +, -), GR, +, -), and (C, +, -) are all 0. The 
characteristic of (Z,, ®, *) is p. 


Remark 7.3.34 An infinite integral domain may also have characteristic p. An ex- 
ample will follow later. 


Proposition 7.3.35 [f (R, +, -) is a commutative integral domain of characteristic 
Dp, then (a+b)? = a?+D? foralla,be R. 


Proof We know that p divides ’C, for all r,1 < r < p—1,andso’C,c = 
o for allc € R, 1 <r <p-— 1. Applying the binomial theorem the result follows. ff 


Corollary 7.3.36 I[fR is a finite field of characteristic p, then given any b € R, there 
is an element a € R such that a? = b. 


Proof Suppose that a? = b?. Then, (a—b)? = a?—b? = 0. Since R is an integral 


domain, a— b = 0, andsoa = b. Thus, the map a ~ a? is an injective map. 
Since R is finite, it is surjective. Hence, given any b € R, there is an element a € R 
such thatb = a?. tt 


Integral Domains of Orders 4 and 8 


Let (R, +, -) be an integral domain of order 4. Since its characteristic is prime and 
divides 4, it is 2. In turn, the additive order of each nonzero element is 2. Hence, it is 
the Klein’s four group. Since it is finite, it is a division ring. Thus, R* is a cyclic group 
of order 3. This also shows that it is a field. LetR = {0, 1, a, a’}. The operations 
in R are obvious. One can check that with these operations, it is a ring. 

Let (R, +, -) be an integral domain of order 8. Again, it is of characteristic 2 
and R* is a cyclic group of order 7. Let R = {0, 1, a, a’, a3, a4, a°, a®}. The 
multiplication is clear. We try to give the addition + in R. Since R is of characteristic 
2,a+a = Oforalla € R. Consider 1+ a.1+ a € 1, for otherwise a = 0. 
1+a a, for otherwise 1 = 0. Suppose that 1+ a = a’. Then,a* = (1+a)? = 
l+a’ = 1+1+a = a. Butthena*® = 1. This is a contradiction, since 
| R* |= 7. Thus, 1 + a 4 a”. Next, suppose that 


ites a (7.3.1) 
Then, 
l+a? = (1+a)* = a°® (7.3.2) 
Ilt+a =1l+l+a=a (7.3.3) 
l+a* = (l+a’) =a® =a (7.3.4) 


lt+o = 141404 = at (7.3.5) 
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l+a° = 1414+ = a’. (7.3.6) 
Since a’ + af = al(14+a/~‘) fori < j, we can find a! + a/ for all i,j. For 
example, a + a = a(1+a’?) = ata® = a’. The addition in R, therefore, is 
given by the following table: 

+]0] 1] a la?la3la4Ja5]a%] 

0/0] 1] ala?la3la4sa5 las 

1/110 Ja3la® a laslatla? 

alala®] 0 fa4] 1 fa?/a%a5 

a*la2/a®la4] 0 fa®| a fas] 1 

ala3/ al 1 fa®| 0 fa®]a?/at 

a*la*la°la7/ a la®| 0] 1 [a>] 

@larlata®latfa21 1/0] a 

a®la®la*la5] 1 fatla3| a] 0 


The verification of the correctness of the table is left as an exercise. Also find other 
possibilities for 1 + a. 


Exercises 


7.3.1 Let (R, +, -) bearing. Show that Z x R is a ring with respect to the coordinate- 
wise addition and multiplication « given by (n, a) * (m,b) = (nm,nb+ma-+ ab). 
Show that it is a ring with identity. Find the identity. 


7.3.2 Let D[O, 1] denote the set of all real-valued differentiable functions on [0,1]. 
Show that it is a commutative ring with respect to pointwise addition and pointwise 
multiplication. Is it an integral domain? 


7.3.3 Let (R, +, -) be aring. Show that (na)-b = n(a-b) = a-(nb) foralla,bé€ 
R, andneé Z. 


7.3.4 Let M,,(R) denote the set of all n x n matrices with entries in a ring (R, +, -) 
with identity. Show that this is a ring with identity with respect to the usual matrix 
addition and matrix multiplication. Show that for n > 2, this is non commutative, 
and also it is not an integral domain. 


7.3.5 Let (R,+,-) be an integral domain. Let e € R* such that e” = e for some 
n > 1. Show that e is the identity of R. 


7.3.6 Let (Ri, +1, *1) and (Ro, +2, *2) be rings. Define operations 6 and* inR = 
R, X Rz by (a, b) @ (c,d) = (a+10¢,b+2d) and (a, b)« (c,d) = (ax, c,b*2d). 
Show that R is a ring which is commutative if and only if R; and Ry are commutative. 
It is with identity if and only if both the rings are with identities. Show also that 
U(R, x Ro) = U(R)) x U(R2). 
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7.3.7 Let (R, +, -) be a commutative ring with identity. Let Uni(R) denote the set 
of unipotent elements of R. Show that it is a subgroup of U(R). Is this result true for 
non commutative rings? 


7.3.8 Let (R, +, -) be an integral domain of order 32. Show that it is a field. Let 
a € R— {0, 1}. Show that (i) lt+a 4 a? (ii) lta x a?. Find the possible values 


of i for which l+a = a’. 


7.3.9 Do the above problem for an integral domain of order 16. 


7.3.10 Can we define multiplication on the additive group Z,4 to make it an integral 
domain? Support. 


7.3.11 Is the ring Zg an integral domain? 
7.3.12 Is R? a ring with respect to vector addition and vector product? 


7.3.13 Let X be a set containing more than | elements. Define a product - on X by 
a-b = b. Can we define addition + on X so that (X, +, -) becomes a ring? Support 
(observe that (X, -) is a semigroup). 


7.3.14 Let (R, +, -) be an structure, where (R, +) and (R, -) are groups and + dis- 
tributes over -. Show that R is a singleton. 


7.3.15 Show that every ring of order 15 is commutative. Can it be an integral domain? 


7.3.16 A formal expression of the type ag + aii + dzj + a3k, a; € R is denoted 
by a and is called a Quaternion. The above expression is written as 0 if and only if 
a; = OVi. The Quaternion aj — aji — azj — a3k is called the conjugate of a and 
is denoted by a. Also, the norm | a | is defined to be +/@ + ai + a + a, It is 


easy to see that the usual property of norm is satisfied. Let R* denote the set of all 
Quaternions. Define © and - in R* by 


a+ b= (ag +bo) + (a + bi )it (a2 + b2)j + (3 + b3)k, 
and 


a-b = (agbo — a,b, — agbz — a3b3) + (dob, + ay bo + a2b3 — a3b2)i + (agbo+ 
anby — ayb3 + a3b1)j + (aob3 + a3bo + ab — arb )k 
Show that it is a division ring (check that if a £ 0, then ia is the inverse of a). 


7.3.17 LetR = {0,e,a, b,c}. Define a binary operation: on Rby0-x = 0 = 
x:-0,e-x = x = x-e forall x € R, and the product of any two of a, b, c is the 
third. Show that there does not exist any operation + on R such that (R, +, -) isa 
ring. 


7.3.18 Show that the ring of analytic functions on a domain is an integral domain. 


7.3.19 Show that there is no set containing all rings. 
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7.4 Homomorphisms and Isomorphisms 


From now onward, the binary operations of rings will not be written unless necessary. 
They will usually be denoted by + and -. Thus, we shall say that R is a ring instead 
of saying that (R, +, -) is a ring. 


Definition 7.4.1 Let R, and R> be rings. A map f from R; to Rp» is called a 
ring homomorphism or simply a homomorphism if 


@) flat+b) = f@+f). 
(ii) f(a-b) = fla) -f). 
foralla,b€R,. 


An injective homomorphism is called a monomorphism, and a surjective ho- 
momorphism is called an epimorphism. A bijective homomorphism is called an 
isomorphism. As in case of groups, composite of homomorphisms are homomor- 
phism. Inverse of an isomorphism is an isomorphism (verify). 


Proposition 7.4.2 Let f be a homomorphism from a ring R, to a ring Ro. Then, the 
following hold: 


(i) f(O) = 0. 
(ii) f(—a) = —f(a). 
(iii) f(a—b) = f(a) —f(d). 
(iv) f(na) = nf(a) 
for alla,b € R, andne€ Z. 


Proof A ring homomorphism from R; to R2 is also a group homomorphism from 
the group (R;, +) to (Ro, +). The result follows from the corresponding results in 


groups. tt 


Remark 7.4.3, A group homomorphism from (R;, +) to (R2, +) need not be ring 
homomorphism. For example, n ~» 27 is a group homomorphism from (Z, +) to 
(Z, +) but it is not a ring homomorphism from (Z, +, -) to itself. 


We say that a ring R; is isomorphic to R» if there is an isomorphism from Rj to 
R>. The notation R; * Rp» stands to say that R; is isomorphic to R2. The relation of 
isomorphism is an equivalence relation on a set of rings. Let R; and R2 be rings. The 
map which takes every element of R; to the zero of Rz is a homomorphism called 
the zero homomorphism. The identity map Jp on a ring R is an isomorphism. 


Example 7.4.4 Let f be a ring homomorphism from the ring Z of integers to itself. 
Then, f is also a group homomorphism from (Z, +) to itself. Thus, there exists 


m € Z such that f(a) = ma for all a € Z. Since it is also a ring homomorphism, 
m-1 = fl) = f-1) = fQ)-fd@) = m-1-m-1 = m*-1, Hence, 
m> = m. This shows thatm = 0 orm = 1, and sof is the zero homomorphism 


or the identity homomorphism. 
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Example 7.4.5 The map a ~ @ is a surjective ring homomorphism from the ring Z 
of integers to the ring Z,,, of integers modulo m. 


Example 7.4.6 Let X be a set and R a ring. Consider the ring R* of functions from 
X to R. Let x € X. Define a map e, from R* toR by e,(f) = f(x). Then, e, is a 
surjective homomorphism (verify). This map is called the evaluation map at x. 


Remark 7.4.7 A ring homomorphism need not take identity to identity, even if the 
homomorphism is injective homomorphism. The ring Z x Z is a ring with identity 
(1, 1). The map a ~ (a, 0) is a ring homomorphism from Z to Z x Z. It does not 
take the identity 1 of Z to that of Z x Z. 


Proposition 7.4.8 Let f be a surjective homomorphism from a ring R, to a ring Ro. 
Let e, be the identity of R,. Then, f (e1) is the identity of Ro. 


Proof Lety € Ro. Sincef is surjective, there is anelementx € R; suchthaty = f(x). 
But then f(e;)y = fle)f@) = flex) = f() = y. Similarly, yf(e;) = 
y forall y € Ro. ft 


Proposition 7.4.9 Letf be anonzero ring homomorphism from a ring R, with iden- 
tity e, to an integral domain R>. Then, f (e,) is the identity of Ro. 


Proof Suppose that f(e;) = 0. Then, f(x) = f(xe;) = f@)f(e1) = 0. This isa 
contradiction to the supposition that f is a nonzero homomorphism. Thus, f(e;) 4 0. 
Now, f(ei)f(ei) = fleiei) = fei) = e2f(e1). By the restricted cancellation in 
Ro, fei) = e2. f 


The above proposition also shows that the only nonzero ring homomorphism from 
Z, to itself is the identity map, for it will take | to 1. 

An injective homomorphism from a ring R) to a ring Ro is called an embedding. 
If there is an embedding from R; to R2, then we say that R; is embedded in R>. 


Proposition 7.4.10 Every ring can be embedded in a ring with identity. 


Proof Let R be a ring. Consider the ring Z x R of the Example 7.1.13 which is with 
identity (1,0). Define a map f from R to Z x R by f(a) = (0,a). Then, f is an 
injective homomorphism, and so it is an embedding. tt 


Let (M, +) be an abelian group. Then, End(M) is a ring with respect to the 
pointwise addition and composition of maps. 


Proposition 7.4.11 Every ring can be embedded in a ring of endomorphisms of an 
abelian group. 


Proof Every ring can be embedded in a ring with identity, and since composite of 
embeddings are embeddings, it is sufficient to show that every ring with identity can 
be embedded in a ring of endomorphisms of an abelian group. Let R be a ring with 
identity. Consider the ring End(R, +) of endomorphisms of (R, +). Let a € R. The 
left multiplication L, from R to R given by L,(x) = ax is an element of End(R, +). 
Define a map ¢ from R to End(R, +) by ¢(a) = Lg. It is easy to check that ¢ 
is a ring homomorphism. Suppose that d(a) = ¢(b). Then, L, = Lp, and so 
a=a-1=UL,(0) = L,(1) = b-1 = Db. This shows that ¢ is an embedding. # 
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The proof of the following theorem is an imitation of the process of embedding 
the ring Z of integers into the field Q of rational numbers as described in the Sect. 3.6 
of the Chap. 3. 


Theorem 7.4.12 Every commutative integral domain can be embedded in a field. 


Proof Let R € {0} be a commutative integral domain. Define a relation ~ on R x R* 
by 
(a, b) ~ (c, d) if and only ifad = be. 


Then, ~ is an equivalence relation: The reflexivity and the symmetry follow from the 
fact that R is commutative. Suppose that (a, b) ~ (c,d) and (c,d) ~ (e, f). Then, 
ad = bcandcf = de. Hence, adf = bcf = bde.Sinced £0,af = be. Thus, 
(a, b) ~ (e,f). 

Let us denote the equivalence class determined by (a, b) by - Thus, ; = {(c,d) | 
ad = bc}. Further, 


; = < ifand only ifad = be. (7.4.1) 


Let F denote the quotient set R x R*/ ~. Thus, 


F = C |aeR, be R’}. 


Suppose that? = {and 4 = {.Then,av = buandcy = dx. But then 


(ad + be)vy = bd(uy + vx). Hence adhe — WEY (by (7.4.1). This ensures that 
we have a binary operation + on F defined by 


a c ad + be 
= : 742 
b + d bd ( ) 
Similarly, we have a binary operation - on F defined by 
ees (7.4.3) 
bd bd 


We show that (F, +, -) is a field. The verification of associativity and commutativity 


of + and -, and the distributivity of - over + is straightforward. o a ; forallb, dé 


R*. The equivalence class o is the zero of F (verify). The negative of ¢ is =* (check 


it). Also, 2 = 4 forall b,d € R*. The element is the identity of F (verify). Let 4 
be a nonzero element of F. Then, a # 0. It is easy to observe that 2 5 = gh = 4 
the identity of F. Hence, every nonzero element of F has the inverse. This completes 
the proof of the fact that F is a field. 

Define a map f from R to F by f(a) = ad d # 0. It is easy to verify that f is a 
homomorphism. Suppose that f(a) = f(b). Then, wd — bd But then ad? = bd?. 


Since d* £0, a = b. This shows that f is an embedding. ft 


The field F together with the homomorphism f introduced in the above theorem 
can be characterized by the following universal property. 
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Theorem 7.4.13 Let R, F, andf be as in the above theorem. Let F" be a field, and ¢ 
an injective homomorphism from R to F'. Then, there exists unique homomorphism 
w from F to F’ such that w of = . Indeed, if the pair (F', @) also satisfies the same 
property, then w is an isomorphism. 


Proof Since ¢ is injective, b 4 0 implies that ¢(b) 4 0. Define a binary relation ~ 
from F to F’ by 
a = 
VG) = o@oby™. 


It is straightforward verification that ~ is a map which is a homomorphism (in fact an 
injective homomorphism) with the required property 7 of = @. Next, observe that 
F(R) = {4 | ae R,d € R*} generates the field F, for? = @. 4 = f(ajf(by. 
Thus, if / is also a homomorphism with y of = ¢, thenw = w. 

Further, if (F’, @) also satisfies the same universal property, then there is a unique 
homomorphism x from F’ to F such that yod = f. Clearly, then ow is the identity 
on f(R), and so it is the identity on F’. Similarly, 7ox is the identity on F’. tt 


Definition 7.4.14 The field F described in the above theorems is called the field of 
fractions or the quotient field of R. 


Remark 7.4.15 It is clear from the above results that the field of fractions of R can 
be viewed as the smallest field containing R. 


The field of fractions of the ring Z of integers is the field Q of rational numbers. 
Interpret the field of fractions of Z[i]. The field of fractions of a field is the field 
itself. 


Exercises 

7.4.1 Find all ring homomorphisms from Z¢ to itself. 

7.4.2 Find all ring homomorphisms from Q to Q. 

7.4.3 Find all continuous ring homomorphisms from R to R and also from C to C. 
7.4.4 Find all ring homomorphisms from the ring of Gaussian integers to itself. 


7.4.5 Let f be a nonzero ring homomorphism from an integral domain R, of char- 
acteristic p to an integral domain R. Show that R> is also of characteristic p. 


7.4.6 Show that a homomorphic image of a commutative ring is commutative. 


7.4.7 Let R be a ring with identity. Define new binary operations @ and * on R by 
a@®b = a+b+landaxb = ab+a-+b. Show that (R, ®, «) is a ring, and the 
map f defined by f(a) = a— 1 is an isomorphism from (R, +, -) to (R, , *). 


7.4.8 Let F bea field and/ asurjective ring homomorphism from F toaring R # {0}. 
Show that R is also a field and f is an isomorphism. 
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7.4.9 Show that the field R of real numbers is not isomorphic to the field C of 
complex numbers. 
Hint. Look at the images of —1 and i. 


7.4.10 Show that there is no surjective homomorphism from R to C (or from C 
to R). 


7.4.11 Let F bea field of characteristic 0. Show that there is an injective homomor- 
phism from Q to F. 

Hint. Consider the map f from Z to F given by f(n) = ne, where e is the identity of 
F’. Show that f is an injective homomorphism. Observe that Q is the field of fractions 
of Z. 


7.4.12 Let F be a field of characteristic p 4 0. Show that there is an injective 
homomorphism from Z, to F. 
Hint. Show that 7 ~~ ne is the required injective homomorphism. 


7.4.13, What is the field of fractions of 2Z,? 

7.4.14 What is the field of fractions of Z[i]? 

7.4.15 Show that any two fields of order 4 are isomorphic. 
7.4.16 Show that any two fields of order 8 are isomorphic. 


7.4.17 Let F be a finite field of characteristic p. Show that the map f defined by 
f(a = a@? is an automorphism of F’. Deduce that any equation of the form x”? = a 
is solvable in F. How many solutions are there if a 4 0? 


7.4.18 Let R be a ring with identity 1 and S a subset of R such that 
@) leS. 
and 


(ii) a, b € S implies that ab € S. 
Define a relation ~ on R x S by 


(a, s) ~ (b, t) if and only if there is an element s’ € S such that 
s'(at — bs) = 


Show that ~ is an equivalence relation. Let S~'R denote the quotient set, and ¢ ~ denote 
the equivalence class determined by (a, s). Show that we have binary operations + 


and - on S~!R defined by 
a b  at+bs 


+ = 
Ss st 
and 
a b _ ab 
s ¢t 7 St 


with respect to which it is a ring. 
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7.4.19 Show that ifo € S,thenS-'R = {0}. 
7.4.20 What happens if S contains a zero divisor? 


7.4.21 Suppose that R is an integral domain and 0 ¢ S. Show that the map f given 
by f(a) = { is an embedding such that each element of f(S) is invertible. What is 
S Rus = RY? 


7.4.22 Find S~'Z, where S = {p" | n € NU{0}} and p is prime. 


7.5 Subrings, Ideals, and Isomorphism Theorems 


Definition 7.5.1 Let R be a ring. A subset S of R is called a subring if the binary 
operations + and - induce binary operations on S with respect to which it is a ring. 


Proposition 7.5.2 A subset S of a ring R is a subring if and only if 


(i) S#¥O. 
(ii) a—beSforalla,beS. 
(iii) a-beSforalla,besS. 


Proof Suppose that S is a subring. Then, + and - induce a binary operation on S with 
respect to which it is a ring. Since every ring is nonempty (contains at least 0), (i) is 
satisfied. Since - induces a binary operation, (iii) is satisfied. Since + induces a binary 
operation with respect to which S is a subgroup, (ii) follows from the corresponding 
result in group theory. 

Conversely, suppose that (7), (ii), and (iii) holds. From (i) and (ii), and the cor- 
responding result in group theory, it follows that + induces a binary operation on S 
with respect to which S is a subgroup of (R, +). Further, by (iii), - induces a binary 
operation in S. That S is a ring with respect to the induced operations is a consequence 
of the corresponding properties in the ring R. tt 


Remark 7.5.3 If S is a subring, then 0 € S and ma € S forallm € Zandae S. 


Example 7.5.4 For each m € Z, mZ is a subring of Z. Since every subgroup of 
(Z, +) is of the form mZ, it follows that every subring of (Z, +) is of the form mZ. 


Z is a subring of Q, Q is a subring of R, and R is a subring of C. 


Example 7.5.5 Consider the ring Z x Z with coordinate-wise addition and multipli- 
cation. (1, 1) is the identity. The subset Z x {0} is a subring. The identity of Z x {0} 
is (1,0) which is different from the identity of the ring. Thus, subring of a ring with 
identity may have identity which is different from the identity of the ring. 


Example 7.5.6 Subring of a ring with identity may not have identity. For example, 
2Z is a subring of Z which is without identity. 
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Example 7.5.7 A ring may be without identity, but a subring may have identity. For 
example, the ring Z x 2Z with coordinate-wise operations is a ring without identity, 
but Z x {0} is a subring with identity. 


Proposition 7.5.8 Let R be an integral domain with identity, and S be a subring of 
R which is also with identity. Then, the identities of R and S are same. 


Proof Let e be the identity of R and e’ be that of S. Then, e’-e’ = e’ = e’-e. From 
the restricted cancellation law in an integral domain, e’ = e. tt 


Example 7.5.9 Let S be a subring of the field R of real numbers. Then, S is also a 
subgroup of the additive group R. Hence, it is cyclic subgroup of R, or it is dense 
in R. Suppose that it is a nontrivial cyclic Supercup generated by a, where a is the 
smallest positive real number in S. Then, a also belongs to S, and hence, a = na 
for some n € N. Thus, ais a natural number. This shows that every non-dense subring 
of R is also a subring of Z. In particular, Z[/2] is dense in R. Also, Z is the only 


subring of R which is with identity and which is not dense. 


The proofs of the following two propositions are straightforward (similar to the 
proofs of the corresponding propositions in groups) and are left as exercises. 


Proposition 7.5.10 Intersection of a family of subrings is a subring. tt 


Proposition 7.5.11 Union of two subrings is a subring if and only if one of them is 
contained in the other. tt 


Let S; and Sj be subrings of aring R. The sum S; + Sy = {a+b|ae5S,,b€ S>} 
is a subgroup of (R, +), but, as in case of groups, it need not be a subring. Consider 
the subring 


Zn] = {ay taym + ann? +--+ ann” | a; € Z, n> 0}, 


and the subring Q of R. We show that Z[7] + Q is not a subring. We use the fact 
that 7 is a transcendental number in the following sense: 


“Thug + uya + Wot? +---+u,-n” = 0, u; €Q, thenu; = 0 for all i”. 


Clearly, 7 € Z[z] + Q and also 5 € Z[r] + Q. Suppose that 50 e€ Z[x] + Q. 
Then, 


1 
Bm = 0 + am tet Gan” +r, 


where a; € Zand r € Q. Using the transcendence of 7, we find that aq; = ‘. This 
is a contradiction. Hence, im ¢ Z[x] + Q. This shows that Z[7] + Q is nota 
subring. 
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Subring Generated by a Subset 


Let R be a ring and X a subset of R. The intersection of all subrings of R containing 
X is a subring of R, and it is the smallest subring of R containing X. This subring 
is called the subring generated by X and is denoted by < X >. The elements of 
< X > are precisely finite sums of integral multiples of finite products of nonnegative 
integral powers of elements of X. In particular, if X = {a}, then < X > denoted by 
<a> is given by 


<a>= {ao + mat ana + +--+ aa" | a; €Z, r> 0}. 


Proposition 7.5.12 Image of a subring under a homomorphism is a subring. Inverse 
image of a subring under a homomorphism is also a subring. 


Proof Let f be ahomomorphism from a ring R; to a ring R. Let S; be a subring of 
R,. Then, 0 € S},andsoO = f(0) € f(S,). Thus, f($,;) 4 @. Let f(a), f(b) € f(S1), 
where a,b € S$. Since S; is a subring, a — b € S; and alsoa-b € S;. Since f is 
a homomorphism f(a) — f(b) = f(a—b) and f(a)- f(b) = f(a-b). Hence, 
f(@ —f (6) and f(a) - f(b) both belong to f(S,). This shows that f(S,) is a subring. 
Similarly, if Sp is a subring of Rp, it can be shown that f~!(S,) is a subring of Ry. # 


Corollary 7.5.13 f—'({0}) is a subring of R,. 
Proof Since {0} is a subring of Ro, the result follows from the above proposition. 


Definition 7.5.14 f—'({0}) is called the kernel of the homomorphism f, and it is 
denoted by ker f. Thus, 


kerf = {ae R,|f(a) = O}. 


is a subring of R;. 


Since image of a subring under a homomorphism is a subring, the image of an 
embedding is also a subring. Thus, if f is an embedding of R; into Ro, then Rj 
is isomorphic to the subring f(R,) of R2. The following three results follow from 
Propositions 6.4.10, 6.4.11 and Theorem 6.4.12. 


Proposition 7.5.15 Every ring is isomorphic to a subring of a ring with identity. 
Proposition 7.5.16 Every ring is isomorphic to a ring of endomorphism. tt 


Proposition 7.5.17 Every commutative integral domain is isomorphic to a subring 


of a field. tt 


Since the images and the inverse images of subrings under homomorphisms are 
subrings, the following correspondence theorem for rings follows from the corre- 
spondence theorem for groups. 
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Theorem 7.5.18 (Correspondence Theorem) Let f be a surjective homomorphism 
froma ring R, to a ring Ro. Let S(R,) denote the set of all subrings of R, containing 
ker f and S(R2) the set of all subrings of Rz. Then, f induces a bijective map $ from 
S(Ri) to S(R2) defined by (8) = f(S). Also (SVT) = 6(S) 6). t 
Example 7.5.19 Lagrange theorem holds for finite rings: If S is a subring of a finite 
ring R, then S is a subgroup of (R, +). By the Lagrange theorem for groups, | S | 
divides | R |. Here also the converse of Lagrange theorem is not true: Let F be a field 
of order 8. We show that F has no subring of order 4. If R is subring of F of order 
4, then it being a subring of a field is an integral domain. Since a finite commutative 
integral domain is field, R is a field. But then R* is a subgroup of F* of order 3. Since 
order of F* is 7, this is a contradiction to the Lagrange theorem. 


Let A be a subring of a ring R. Then, A is a subgroup of (R, +). Consider the 
quotient group 
R/A = {x+A|x eR}. 


Since (R, +) is an abelian group, R/A is also abelian. The addition in R/A is given 
by 
@+A)+ G44) = @HY+A. 


We want to make R/A a ring. Our temptation would be to define a multiplication - 
in R/A by 
(x+A)-(WHtA) =x-y+A. (7.5.1) 


But - defined above need not be a binary operation. For example, consider R/Z. 
We have J2 + Z = 14+.724Zand V3 + Z = 14/3 + Z. But 
J2V3 +Z4(1+V2)14+ V3) + Z, for (1+ 72) + 73) — 243 is not in 
Z. 


Suppose that - defined by | is a binary operation. Then, given a € A andx € R, 
a+A = 0+A and so (a+A)-(*+A) = (0+A)(x +A). This means that 
ax +A = O0-+A. Hence ax € A. Thus, ax € A for alla € A andx € R. Also 
(x+A)(a+A) = (x+A)(0+A), andsoxa+A = 0+4A. It also follows that 
xa €Aforallae Aandx eR. 

Conversely, suppose that S is a subring of R such that ax and xa belong to A for 
alla € A and x € R. Suppose thata+A = b+Aandc+A = d-+A. Then, 
(a — b) € A and (c — d) € A. Hence, from our hypothesis, (a — b)-c € A and 
b-(c—d) € A. Since A is a subring, ac — bd = (a—b)c + b(c — d) belongs 
to A. This shows that ac+ A = bd +A, and so - defined by 1 is indeed a binary 
operation. We have proved the following theorem. 


Theorem 7.5.20 Let R bearing and A a subgroup of (R, +). Then, we have a binary 
operation - on R/A defined by 


(a+A)-(b+A) = ab+A. 


if and only if ax and xa belong to A for allx € Randa é€A. tt 
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Definition 7.5.21 A subring A of R is called a left(right) ideal if xa (ax) belongs 
to A for all x € R and for all a € A. We say that A is an ideal of R if it is both-sided 
ideal. 


Let A be an ideal of R. Then, it is easily seen that R/A is a ring with respect to the 
binary operations + and - on R/A defined by 


(x+A)+(y+A) = &+y)+A. 


and 
(x+A)-(V+A) = xy+A. 


This ring is called the difference ring or quotient ring of R modulo A. 

The map v from R to R/A defined by v(x) = x+A is asurjective homomorphism 
whose kernel is A. Thus, every ideal is a kernel of a homomorphism. The map r is 
called the quotient map modulo A. 

It can be seen as before that the image of a left ideal (right ideal) under a surjective 
homomorphism is a left (right) ideal. Also, in the correspondence theorem, ideals 
correspond. Inverse image of a left (right) ideal under a homomorphism is a left 
(right) ideal. In particular, kernel of a homomorphism is an ideal. 

As in case of groups, any subring of R/A is of the form B/A, where B is a subring 
containing A. It is a left (right) ideal if and only if B is a left (right) ideal of R. 

The results such as the fundamental theorem of homomorphism, Ist isomorphism 
theorem, and 2nd isomorphism theorem are true in case of rings also. In the corre- 
sponding results for groups, replace subgroups by subrings and normal subgroups 
by ideals to get the corresponding result for rings. The proofs are also on the same 
lines. As an illustration, we prove the 2nd isomorphism theorem. 


Theorem 7.5.22 (Noether 2nd Isomorphism theorem). Let A and B be subrings of 
R and B an ideal of R. Then, A ()\ B is an ideal of A, and A/(A (| B) © (A+ B)/B. 


Proof Since B is an ideal, A + B is a subring (verify and compare with the corre- 
sponding result in groups) and B is an ideal of A + B. Define a map f from A to 
(A+ B)/B by f(a) = a+B. Then, f is clearly a ring homomorphism. Further, any 
element of (A + B)/B is of the form a+ b+ B for some a € A and b € B. Since 
(a+b)—a = bbelongstoB,a+b+B = a+B = f(a). Thus, f is surjective 
homomorphism. Also, 


kerf ={aeEA|f@ =Bs={aeA|a+B=B}={aceA|laeB} 


Thus, kerf = A()B. Since kernel of ahomomorphism is an ideal, A () B is an ideal 
of A. By the fundamental theorem of homomorphism, A/(A (| B) is isomorphic to 
(A + B)/B. tt 


Proposition 7.5.23 A ring R is without proper left (right) ideals if and only if it is a 
zero ring of prime order, or it is a division ring. 
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Proof Suppose that R is without proper left ideals. Let a € R, a £ 0. Then, 
Ra = {xa|x € R}. 


is a left ideal of R. There are two cases: 


(i) Ra = {0} for some a 4 0. 
Gi) Ra = Rforalla € R* = R-— {0}. 


Consider the case (1). Let a 4 0 be such that Ra = {0}. Consider the left ideal 
< a > generated by a. Then, 


<a>= {na+xa|n€Zandx € R}. 


Since < a > {0} (note thata E< a >), < a >= R. Also, since Ra = 
{0}, xa = O forall x € R. Hence, R =< a>= {na | n € Z}. Since 
Ra = {0}, na-ma = nma-a = 0. Thus, R is a zero ring. But then every subgroup 
of (R, +) will be a left ideal, and since R is supposed to be without proper left ideals, 
(R, +) is without proper subgroups. Hence, in this case, R is a zero ring of prime 
order. 

Consider the case (ii). In this case Ra = R for alla 4 0. Suppose thata 4 0 F b. 
Then, Rab = (Ra)b = Rb = R. Hence, a 4 0,b £ 0 implies that ab ¢ 0. This 


shows that R is an integral domain. Let a € R,a ~ 0. Since Ra = R, there is an 
element e € R, e £0 such that ea = a. Also (ae—a)a = a —a’ = 0. Since 
a # 0, and R is an integral domain, ae = a. Let b be another nonzero element of 


R. Then, a: (eb — b) = ab—ab = 0.Sincea £0, eb = bforallb € R. Also, 
(be—b)-b = b* —b* = 0.Sinceb 40, be = b forall b € R. Hence, e is the 
identity of R. If a # 0, then Ra = R, and so there is an element a’ € R, a’ #0 
such that a’-a = e. This shows that R* is a group, and so R is a division ring. 
Conversely, we show that a zero ring of prime order or a division ring will have 
no proper left ideals. If R is zero ring of prime order, then it has no proper subgroups 
of (R, +), and so it has no proper left ideals. Next, suppose that R is a division ring. 
Let A be a nonzero left ideal. Leta € A, a & 0. Since R is a division ring, there 
is an element a’ € R such that a’-a = 1. Since A is a left ideal, 1 € A. But then 
x = x-1 belongs to A for all x € R. This shows thatA = R. Thus, R has no proper 
left ideals. The result for right ideals follows on the same lines. ft 


Since a ring with identity cannot be a zero ring, the following result is immediate 
from the above proposition. 


Corollary 7.5.24 A ring with identity is without proper left (right) ideals if and only 
if it is a division ring. tt 
Corollary 7.5.25 A commutative ring with identity is a field if and only if it is without 
proper ideals. ft 


Corollary 7.5.26 An ideal M of a commutative ring R with identity is a maximal 
ideal if and only if R/M is a field. 


244 7 Elementary Theory of Rings and Fields 


Proof Since every ideal of R/M is of the form B/M, where B is an ideal containing 
M, it follows that M is a maximal ideal if and only if R/M is without proper ideals. 
The result follows from the above corollary. tt 


Proposition 7.5.27 A ring R is without proper subrings if and only if it is a zero 
ring of prime order or a field of prime order. 


Proof If R has no proper subrings, then it has no proper left ideals also. Hence, R 
is a zero ring of prime order or a division ring. Suppose that R is a division ring, 
and | is the identity of R. The map f from Z to R defined by f(n) = nl is a ring 
homomorphism, and so f(Z) 4 {0} is a subring of R. Hence, f(Z) = R. Since R is 


a division ring and Z is not, f cannot be injective. It follows that ker f = mZ for 
some m #0, and Z/mZ = Z,, © R. Since Z,, is a division ring if and only if m is 
prime, the result follows. tt 


Compare the results with the corresponding result in groups. 


Example 7.5.28 Let F be a field. Consider the ring M2(F) of 2 x 2 matrices with 
entries in the field F. Let A be the set of all matrices whose 2nd column is the zero 
column. Then, A is a left ideal (verify). It is not a right ideal, for e;; € A, and 
€11:M2(F) = Bis the set of all matrices whose 2nd row is zero (verify). Note that 
B is aright ideal. More generally, consider the ring M,,(F) of all n x n matrices. Let 
C; denote the set of all matrices having all its columns zero except the ith column. 
Then, C; is a left ideal but not a right ideal. It may also be observed that each C; is 
minimal left ideal of M,,(F). Let R; denote the set of all matrices having all its rows 
zero except ith row. Then, R; is a (minimal) right ideal which is not a left ideal. 


Example 7.5.29 In this example, we show that the ring M2(F) of 2 x 2 matrices 
with entries in a field F is without proper two sided ideals. Let O be a nonzero two- 
sided ideal of Mz(F). Let A be a nonzero element of U. It is an elementary fact of 
matrix theory (see Algebra 2 or any book on linear algebra) that there are nonsingular 
matrices P and Q such that PAQ is the identity matrix J or e;; (depending on whether 
the rank is 2 or 1), where e; denote the matrix whose ith row and jth column entry 
is | and the rest of the entry is 0. Thus, J € U or e;; € UO. If € OU, then since O is 
an ideal, every element of Mz(F) isin UO andsoO = M2(F). Suppose that e;; € O. 
Then, €22 = €21€11€11@12 belongs to U. But then 7 = e 4; + e22 belongs to U. 
Hence, in this case, alsoU = M)(F). 


Remark 7.5.30 Propositions 7.5.23 and Corollary 7.5.24 are not true if we replace 
left ideals by ideals. 


Theorem 7.5.31 (Krull). Every proper ideal (left ideal/right ideal) of a ring with 
identity is contained in a maximal ideal(left ideal/right ideal). 


Proof Let R be a ring with identity and A an ideal (a left ideal/a right ideal) of 
R, A €R. Then, | ¢ A. Let X be the set of all ideals (left ideal/ right ideal) of R 
containing A but not 1. Then, A € X, and so X 4 W. Thus, (X, C) is a nonempty 
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partially ordered set. Let {B, | a € A} be a chain in X. Then, the union of this chain 
is also an ideal (left ideal/right ideal) containing A but not 1. Thus, every chain in 
X has an upper bound. By the Zorn’s Lemma, X has a maximal member M (say). 
Then, M is a maximal ideal (left ideal/right ideal), for if M is properly contained in 
N,thenN ¢ X. SinceA CN, 1 € N. ButthenN = R. tt 


Definition 7.5.32 Let R be a ring. An ideal go of R is called a prime ideal of R if 
R/ is an integral domain. 


Proposition 7.5.33 An ideal g of R is a prime ideal if and only if ‘ab € go implies 
thatae porbe gp’. 


Proof Suppose that go is a prime ideal. Then, R/g is an integral domain. Suppose 
that ab € 9. Then, (a+ 9)(b+ 9) = ab+p = 04+ 6 = @ (the zero of R/s9). 
Since R/ is supposed to be an integral domain,a+ g = gorb+ = @. This 
means thata € p orbe go. 

Conversely, suppose that “ab € 9 implies that a € g~ or b € &’. Further, suppose 
that (a+ )(b+9) = g (the zero of R/s9). Then, ab € g9. Hence, by the supposition, 
aéporbe g.Butthena+p = porb+p = #. tt 


Corollary 7.5.34 Every maximal ideal of a commutative ring with identity is a prime 
ideal. 


Proof Tf M is a maximal ideal of a commutative ring with identity, then R/M is a 
field, and so an integral domain. Hence, M is a prime ideal. tt 


Remark 7.5.35 The above corollary is not true for a noncommutative rings. For 
example, Mz(F), where F is a field, is a ring without proper ideals, and so {0} is the 
maximal ideal. This is not a prime ideal, for Mz(F) is not an integral domain. We 
also observe that the result is not true if the ring is without identity. For example, the 
zero ring on a prime cyclic group has {0} as maximal ideal, but it is not a prime ideal. 


Example 7.5.36 {0} is a prime ideal of Z which is not a maximal ideal. Since 
Z/[mZ = Z,» is an integral domain if and only if m is prime, it follows that mZ is a 
prime ideal if and only if m is prime. In this case, it is also a maximal ideal. 


Proposition 7.5.37 Let R be a commutative ring and A an ideal of R. Then, 
VA = {aeER|a" € A for some n > 1} 


is an ideal of R. 


Proof Clearly 0 € \/A, for 0 € A. Thus, /A 4 Y. Leta, b € V/A. Then, a”, b” € A 
forsome m,n € N. Ifr+s = n+m,thenr > nors > m. But then a’b’ € A. Since 
A is an ideal, applying the binomial theorem, we see that (a — b)"t" © A. Hence, 
(a — b) € V/A. Next, suppose that a € V/A and x € R. Then, a” € A for somen € N. 
But then (xa)"=x"a" € A. Hence, xa € V/A. This shows that \/A is an ideal of R. t 
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Definition 7.5.38 The ideal \/A is called the radical of A and \/R is called the 
nil radical of R. 


Proposition 7.5.39 Let R be a commutative ring with identity. Let A be an ideal of 
R. Then, V/A is the intersection of all prime ideals containing A. 


Proof Let x € \/A, and a prime ideal containing A. Then, x” € A C for some 
n &N. Since g is a prime ideal, and x” € go, x € go. Hence, VA C g. Thus, V/A is 
contained in the intersection of all prime ideals containing A. Suppose that x ¢ V/A. 
Then, we shall show the existence of a prime ideal so such that A C but x ¢ @. 
Consider the subset S = {x" | n € N}. Since x ¢ VA, S (\A = @. Let 


X= {B | B is an ideal of R and B( |S = U}. 


Clearly, A € X, and so X # Y. Thus, (X, C) is a nonempty partially ordered set. 
If {B, | @ € A} is a chain in X, then its union is an ideal containing A whose 
intersection with S is empty set. Thus, the union is an upper bound of the chain, and 
so every chain in X has an upper bound. By the Zorn’s lemma, X will have a maximal 
element go (say). Then, A C go and 9 (|S = @. We show that go is a prime ideal. 
Suppose that a ¢ go and b ¢ g. Then, the ideals < go U{a} > = gw + Ra and 
< p U{b} > = » + Rb do not belong to X. Thus, 9 + Raand so + Rb intersect 
S non trivially. Hence, 
x" = a+ Ba 


and 
x" = y + 6b 


forsomen,meN, a, ye pandf, 6€R. But then 
x"t™ — Béab + adb + yu + ay = Boab + yp, 


where jz € 99. Since x"t" ¢ —9, ab ¢ 69. This shows that go is a prime ideal containing 


A but not x. tt 
Example 7.5.40 Consider an ideal mZ of Z, where m = p{'p5*---p? , p; # p; for 
i #j. Then, {p;Z, poZ, ..., PrZ} is the set of distinct prime ideals containing mZ 


(verify). Hence 


VmZ = PiZ( | poZ(\--- (| paZ = Pipr-+** PnZ. 


Definition 7.5.41 Anideal A of acommutative ring R with identity is called a radical 
ideal if /A = A. 


Thus, every prime ideal is a radical ideal. The ideal 6Z of Z is a radical ideal, but 
it is not a prime ideal. It can be checked that in the correspondence theorem, prime 
ideals and radical ideals correspond. 
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Definition 7.5.42 A ring R is called a reduced ring if it is without nonzero nilpotent 
elements. 


Thus, every integral domain is a reduced ring. Z,, is a reduced ring if and only 
if m is product of distinct primes. In particular, Z)9 is a reduced ring (it is not an 
integral domain). 


Proposition 7.5.43 An ideal A of a commutative ring R with identity is a radical 
ideal if and only if R/A is a reduced ring. 


Proof Note that x” € A is equivalent to say that (x + A)" = x”+A = A. Thus, 
x € VA if and only if x+A is nilpotent in R/A. The result follows from the definition 
of a radical ideal and that of a reduced ring. tt 
Exercises 


7.5.1 Give two examples of subrings which are not ideals. 


7.5.2 Find conditions on subrings A and B so that A + B is a subring. Show that if A 
or B is an ideal, then A + B is a subring. Show that sum of any two ideals is an ideal. 


7.5.3 Find all subrings of Zi¢. Are they all ideals? 


7.5.4 Show that if S is a non-dense subring of the field R of real numbers, then it is 
of the form mZ for some m € Z. Deduce that the set {a+ bV2 | a, b € Z} is a dense 
subset of R. 


7.5.5. Characterize rings with identities which are generated by their identities. 
7.5.6 Prove the Correspondence theorem and the Ist isomorphism theorem for rings. 
7.5.7 Characterize fields in which all subrings are subfields. 


7.5.8 Show that union of chain of subrings (left ideals, ideals) is a subring (left 
ideals, ideals). 


7.5.9 Let F be a field of order p”, and R a subring of F’. Show that the order of R 
is p”, where m divides n. The converse of this is also true, and the proof of this fact 
can be found in Chap. 9 of Algebra 2. 

Hint. Observe that p” — 1 divides p” — 1 if and only if m divides n. 


7.5.10 Let R bearing. Let Z(R) = {a € R| ax = xa forall x € R}. Show that 
Z(R) is a subring of R. This subring is called the center of R. 


7.5.11 Show that the center of a division ring is a field. 
7.5.12 Show that Z(M,,(F)) is a subring isomorphic to the field F’. 


7.5.13 Let A be the subset of M,,(F) consisting of matrices all of whose columns 
are zero except the Ist column which is arbitrary. Show that A is a minimal nonzero 
left ideal of M,,(F). 
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7.5.14 For any field F', show that M,,(F’) has no proper two-sided ideal. 


7.5.15 Show that the intersection of a maximal ideal of a ring with a subring need 
not be maximal ideal of the subring, whereas it remains a prime ideal. 


7.5.16 Let R bearing, and S a subring. Let go be a prime ideal of R. Show that  () S 
is a prime ideal of S. 


7.5.17 Let R be aring, and e an idempotent element of R. Show that eRe is a subring 
of R with e as identity. 


7.5.18 Suppose that e is idempotent. Show that 1 — e is also idempotent. 


7.5.19 Let R be aring with identity. Let a, b € R be such that 1 — ab is a unit. Show 
that 1 — ba is also a unit, and 


(1 — ba)! = 14+ b(1 —ab)"!a. 
7.5.20 Show by means of an example that radical of distinct ideals may be same. 
7.5.21 Show that under the correspondence theorem radical ideals correspond. 
7.5.22 Show that radical of radical of A is radical of A. 


7.5.23 Show that the ideal generated by an element a of the ring R is {na+xa|n € 
Zand x € R}. 


7.5.24 Let R be a ring. Consider the ring R x R with coordinate-wise addition and 
multiplication. Let T be an equivalence relation on R which is a subring of R x R 
(such an equivalence relation is called a congruence). Show that 7p is an ideal. 
Conversely, suppose that B is an ideal of R. Consider the relation T on R given by 
‘T = {(a,b) € Rx R | (a—b) © B’. Show that T is a congruence such that 
To = B. 


7.5.25 Let f be a map from a ring R, to a ring Ro. Show that f is a homomorphism 
if and only if {(x, f(x)) | x © R;} is a subring of R; x Ro. 


7.5.26 Describe the notion of composition series for rings. Establish the Jordan— 
Holder theorem for rings. 


7.5.27 Describe the concept of indecomposable rings. Give some examples of in- 
decomposable rings. 


7.5.28 Discuss the Krull-Remauk—Schmidt theorem for rings. 


7.5.29 Let A be an ideal of R. Let r(A) = {x € R| xa = O forall a € A}. Show 
that r(A) is a left ideal of R. 
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7.5.30* Let C[a, b] denote the ring of all real-valued continuous functions on [a, b]. 
Let A be a proper ideal of C[a, b]. Show that there exists an element x € [a, b] such 
that all members of A vanish at x. Deduce that the maximal ideals of C[a, b] are of 
the form M@, = {f € C[a,b] | f(x) = 0}. Show that the map x ~ M, defines 
a bijective map from [a, b] to the set Max(C[a, b]) of maximal ideals. Interpret the 
induced topology on Max(C[a, b]). 


Ordered Integral Domain and Fields 


Definition 7.5.44 Let R be a commutative integral domain. An 
order structure on R is a subset P of R which satisfies the following conditions: 


G) x+yePforallx,y €P. 
(ii) x-y € Pforallx,y € P. 
(iii) For any x € R, one and only one of the following holds: 


(a) x = 0. 
(b) x EP. 
(c) —x EP. 


Thus, 0 ¢ P. The set P is called the set of positive elements of R. 


7.5.31 Let R be an ordered commutative integral domain with the order structure P 
on it. Show that 


G) LeP. 

(ii) —1 ¢P. 
(iii) Ifa 40, thena’ € P. 
(iv) fat + ai +--+ @ = 0, thena; = Oforalli. 
Deduce that the field C of complex numbers cannot be given an order structure. 
7.5.32 Show that no finite integral domain can be given an order structure. 
7.5.33 Show that every ordered integral domain is of characteristic 0. 


7.5.34 Show that N is an order structure on Z. 


7.5.35 Show that every commutative ordered integral domain contains a copy of Z. 
Deduce that Z is the smallest ordered integral domain. 


7.5.36 Show that Q is an ordered field. 


7.5.37 Let P be an order structure on a commutative integral domain R. Define a 
relation < and < on R as follows: 


x < yifand only ify —x € P, 


and 
x < y ifand only ify — x e P| _J{0}. 
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Show that < is nonreflexive, antisymmetric, and transitive. Show that (R, <) is a 
totally ordered set. 


7.5.38 Let denote the set of all Cauchy sequences in Q (see Sect.3.7 for the 
definition). Show that I is a commutative ring with identity with respect to the 
pointwise addition and multiplication. 


7.5.39 Define a map ¢ from Q to TF by ¢(r)(n) = r forall n € N. Show that ¢ is 
an embedding. 


7.5.40 Let & denote the set of all null sequences (see Sect.3.7). Show that & is an 
ideal of T’. 


7.5.41 Let f ¢ T — &. Show that there exist r € Q, r > O and ng € N such that 
|f(n) | > rforalln > no. 


7.5.42 Use the above exercise to show that 8 is a maximal ideal of I’. Deduce that 
I’/® is a field. Show that this is the field R of real numbers as introduced in Sect. 3.7. 
The field I’/8 is called the field R of real numbers. 


7.5.43 Show that the map ® from Q to R defined by ®(r) = g(r) + Nis an 
embedding. 


7.5.44 Let P denote the set 


{f +8 | there existr € Q,r >Oand ng € N such that n > 
no implies that f (n) > r}. 


Show that P is an order structure on R. 


7.5.45 Recall that a partially ordered set (X, <) is said to an order complete set if 
every nonempty set which has an upper bound has a least upper bound. Show that R 
is an order complete field with respect to the order induced by the order structure P 
on R. 


7.6 Polynomial Ring 


Let R’ be a ring with identity containing a ring R with identity as a subring. We 
assume that the identity of R is same as that of R’. We also call R’ a ring extension of 
R. Let S be a subset of R’ which commute with R element wise. Let < S > denote 
the sub-semigroup of the multiplicative semigroup R’ which contains the identity 
of R’ and which is generated by S. More explicitly, < S > denotes the set of finite 
products of nonnegative integral powers of elements of S. Then, the subring R(S) of 
R’ generated by RL) S is given by 
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{Xygecss gg | a, € Rand geé<S> withag = 
0 for all but finitely many g}. 


More generally, and more formally, let R be a ring with identity 1, and G be a 
semigroup with identity e. Let R(G) denote the set of all maps from G to R which are 
zero at all but finitely many members of G. Thus, f € R(G) means thatf is amap from 
G to R for which there is a finite subset F of G such thatf(x) = Oforallx e G—F. 
Let f, g €¢ R(G). Define a map f + g from G to R by f + g)\(@x) = f(x) + g@). If 
f() = Oforallx ¢ G—Fj, and g(x) = 0 forallx € G— Fo, where F, and F2 are 
finite sets, then F, ) F> is a finite set, and (f+ 9)(x) = Oforallx € G—(F; U Fo). 
Thus, (f + g) € R(G). This defines a binary operation + in R(G) with respect to 
which it is an abelian group (verify). Further, define a map f - g from G to R by 


ff -g)@) = Xy.zaxf (vy) g(Z). 


Observe that the sum in the right-hand side is essentially finite, for f and g both are 
zero at all but finitely many members of G. Further, if f is zero outside F,, and g is 
zero outside F, then f - g is zero outside F;} -F2 = {a-b|aeé F\,b © F}. This 
gives us another binary operation - on R(G). We show that (R(G), +, -) is aring with 
identity. 


(F- 9) MQ) = YyrarFh- MDOAZ) = 
Ly.z = x(Lyy = yf (u)g(v)) ; h(z) = LYwv)-z = af (u)g(v) A(z) = 
Yur = xf U)(2o.2 = 19h) = Lara. Af WG HO = F-(g-A))Q). 


This shows that - is associative. Similarly, we can show that - distributes over +. The 
map £ from G to R defined by €(e) = 1 and ¢(x) = 0 for all x F e is the identity. 
Thus, (R(G), +, -) is a ring with identity. This ring is called the semigroup ring of 
the ring R over the semigroup G. If G is a group, then we term it as a group ring. 

Define a map ¢ from R to R(G) by d(a)(e) = aand d(a)(x) = Oforallx #e. 
It is easy to check that ¢ is an embedding of the ring R into the semigroup ring R(G). 

Next, define a map w from G to R(G) by w(g)(g) = landw(g)(@) = 0 for all 
x & g.v can also be seen to be an embedding of the semigroup G into the semigroup 
(R(G), -). If we identify R as a subring of R(G) through ¢ and G as sub semigroup of 
R(G) through w (i.e., we identify ¢(a) with a and w(g) with g), then every element 
of R(G) is of the form 

LyeGAg9, 


where the sum is essentially a finite sum in the sense that a, = 0 for all but finitely 


many g € G (this element is the representation of the map a from G to R given by 
a(g) = ay). The addition + and the multiplication - are given by 


LgeGAgg + VyeGFy9 = YgeG(Ag + By)gs 
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and 

(XgeGQg9) : (XyeGFy9) = YgeG (Un-kag Ane) g- 
Clearly, under the identification, the semigroup G is a sub-semigroup of the multi- 
plicative semigroup of R(G), and the group U(G) of units of G is a subgroup of the 


group U(R(G)) of units of R(G). In particular, any group G is a subgroup the group 
U(R(G)). 


Example 7.6.1 Let us describe the group ring R(G) of the field IR over the group 
G = {e, g} of order 2. Clearly, 


R(G) = {ae + bg | a,b €R}. 
The addition + is given by 
(ae + bg) + (ce + dg) = (ato + (b+d)g 
and the multiplication - is given by 
(ae + bg)-(ce + dg) = (ac+bd)e + (ad+bc)g. 

le is the multiplicative identity. An element ae + bg is a unit if and only if the 
system 

aX + bY = 1 

bX + aY = 0 
of linear equations over R has a unique solution. Thus, ae + bg is a unit if and only 
if a’ — b* # O, and then, age = og is the inverse of ae + bg. The group 
U(R(G)) of units of R(G) is given by U(R(G)) = {ae+bg | a — Bb? ¢ O}. 


Observe that R(G) — U(R(G)) is the set of all zero divisors. In particular, le + lg 
is a zero divisor. 


Example 7.6.2 Let us describe the group ring Z(V4) of the ring Z over the Klein’s 
four group V4. Clearly, 


Z(V4) = {age + aja + anb + ayc | a; € Z}, 
where V; = {e, a, b, c} is the Klein’s four group. The addition + is given by 


(age + aja + arb + a3c) + (Soe + Bia + fob + betasc) = 
(ay + Boje + (a1 + Bia + (a2 + f2)b + (a3 + fs)e. 


The product - is given by 


(ape + ajya+agb+a3c) - (Goe+ Bat Pob+ betasc) = yetyat 72b+ 43¢. 
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where Yo = 0039 +013; +022 +0333, V1 = A081 +0189 + 0233 +0382, 72 = 
a 82 + A289 + 0183 + 0381, 73 = A083 + 0389 + A182 + a2). Find out the 
group U(Z(V4)) of units of Z(V4). 


Now, we study a very particular but universal semigroup ring R(G), where G is 
an infinite cyclic semigroup with identity which is generated by a symbol X. Thus, 
G =<X >= {x" | n € NU{0}} with the understanding that X” = X” if and 
only if n = m. The operation - in G is given by X"- X” = X"*", The element X° 
is the identity of G, and it is denoted by e. Note that any two such semigroups are 
isomorphic. Indeed, G is isomorphic to the semigroup N (_){0} with usual addition. 
The symbol X is called an indeterminate or transcendental element. The semigroup 
ring R(G) is given by 


RG) = (a + aX + ax? +61 + aX" | Geek, 


Clearly, this ring is generated by R J {X}. We denoted this ring by R[X] and call it 
the polynomial ring over R in one indeterminate. The elements of R[X] are called 
polynomials in one variable. We note the following. 


(i) dg + ayX + aX? + +) + aX" = 0 if and only if a; = o for alli. 
(li) dg +ayX+---+a,X" = bo t+b\X+---+b,X" if and only ifa; = b; for alli. 
(iil) dg + a,X + +++ + a,X" + bo + bX +--+ + by X™ 


= (ag + bo) + (ay + DY)X +++ + Gn + bp) X" + Dy Xt! Hee + Dy X™, 
where n < m. 
(iv) (4o + aX + +++ + anX") + (bo + 1X +++ + bX") 


= cotcyX +--+ +CnpmX"™, where c; = Lyre = iajde. 


If f(X) = agp +a)X +---+a,X", where a, 4 0, then a,X” is called the 
leading term of the polynomial, a, is called the leading coefficient, and n is called 
the degree of the polynomial. Thus, degree of every nonzero polynomial is defined. 
It can be considered as a map from R[X]* = R[X] — {0} to N U{0}. The degree of 
a nonzero polynomial f(X) is denoted by deg(f(X)). Degree of zero polynomial is 
not defined. 

The proof of the following proposition is straightforward verification. 


Proposition 7.6.3. The polynomial ring R[X] is commutative if and only if R is 
commutative. tt 


The proof of the following proposition is immediate from the definition of addition 
and multiplication of polynomials. 


Proposition 7.6.4 Let f(X) and g(X) be nonzero polynomials such that f(X) + 
g(X) # 0. Then, deg(f (X)+g(X)) < max(deg(f(X)), deg(g(X))). If F(X)-9(X)) F 
0, then deg(f(X) - g(X)) < deg(f(X)) + deg(g(X)). tt 
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In general, strict inequality may hold. For example, take f(X) = 1+X and g(X) = 
2—X in Z[X]. Then, deg(fX)) = 1 = deg(g(X)), whereas deg(f(X)+ g(X)) = 0. 
Also, consider the nonzero polynomials f(X) = 1 + 3X and g(X) = 1 + 2X 
in Ze[X]. Then, f(X)- g(X) = 1 + 5X and so deg(f (X) - g(X)) = 1 whereas 
deg(f (X)) + deg(g(X)) = 2. However, we have the following: 


Proposition 7.6.5 If R is an integral domain, then deg(f (X) - g(X)) = deg(f (X)) + 
deg(g(X)). 


Proof Yf the leading coefficient of f(X) is a, A O and that of g(X) = by, € 0, 
then the leading term of f(X) - g(X) is ayb»,X"*", and since R is an integral domain 
dn» bm # 0. Thus, deg(f(X) - g(X)) = deg(f(X)) + deg(g(X)). t 


Proposition 7.6.6 R[X] is an integral domain if and only if R is an integral domain. 


Proof If R is an integral domain, then the proof of the above proposition says that 
R[X] is also an integral domain. Conversely, if R[X] is an integral domain, then R 
being a subring of R[X] is an integral domain. tt 


Proposition 7.6.7 Let R be an integral domain with identity. Then, units of R[X | are 
those of R. Thus, U(R[X]) = U(R). 


Proof Since R is a subring of R[X] and the identity of R is that of R[X], units of R 
are also units of R[X]. Let f(X) be a unit of R[X]. Then, there exists g(X) € R[X] 
such that f(X) - g(X) = 1. Comparing the degrees of both the sides, we see that 
deg(f (X)) = 0 = deg(g(X)). Hence f (X) and g(X) both belong to R. This shows 
that units of R[X] are also units of R. tt 


Corollary 7.6.8 R[X] can never be a field. 
Proof X #4 0, and it cannot be a unit of R[X]. tt 


Remark 7.6.9 Tf R is not an integral domain, then there may be units of RLX] which 
are not in R. For example in Z4[X], (1+ 2X)-(. + 2X) = 1. Thus, 1 + 2X isa 
unit in Z4[X] which is not in Z4. 


Theorem 7.6.10 (Division Algorithm) Let R be a commutative integral domain with 
identity. Let f(X), g(X) € R[X]. Suppose that g(X) # 0, and the leading coefficient 
of g(X) is a unit. Then, there exists unique pair (q(X), r(X)) in R[X] x R[X] such 
that 

F(X) = g(X)g(X) + r(X), 


where r(X) = Oorelse deg(r(X)) < deg(g(X)). 


Proof If deg(g(X)) = 0, then g(X) = a € R. Since the leading coefficient of g(X) 
is assumed to be a unit, g(X) = aisaunit of R. But then 


F(X) = a f(X)-gX) + 0, 
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and there is nothing to do. Next, suppose that deg(g(X)) > 0.Iff(X) = 0, then 
F(X) = 0-g(X) + 0, 


and again, there is nothing to do. Suppose that f(X) 4 0. The proof in this case is 
by the induction on the degree of f(X). If deg(f(X)) = 0 < deg(g(X)), then 


SX) = 0-g(X) + F(X), 
where deg(f(X)) < deg(g(X)). Thus, the result is true whenever deg(f(X)) = 0. 
Assume that the result is true for all those polynomials whose degree is less than the 
degree of f(X). Then, we have to prove it for f(X). If deg(f(X)) < deg(g(X)), then 
again 

f(X) = 0- G(X) + F(X), 
and so there is nothing to do. Suppose that deg(f(X)) > deg(g(X)). Let 

f(X) = ao + aX + --- aX’, 


where a, 4 0, and 
G(X) = bo + bX +--+ BX", 


where b, is a unit and r > s. Consider the polynomial 
fiX) = fF) — a,by'X'*- g(X). 


If fi(X) = 0, then there is nothing to do. If not, deg(fi(X)) < deg(f(X)). By the 
induction assumption there exist q,(X) and r(X) such that 


AQ) = gX)gX) + r(X), 
where r(X) = Oor else deg(r(X)) < deg(g(X)). But then 


fQ&®) = fi) + a,by'X"g(X) = 
(qi(X) + a,-by'X'™)g(X) + r(X) = g(X)g(X) + r(X), 


where r(X) = Oor else deg(r(X)) < deg(g(X)). 
Finally, we prove the uniqueness of the pair (g(X), r(X)). Suppose that 


FX) = aX)gX) + n(X) 


and 
f®) = eXMgX) + r2(X), 
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where 7;(X) = 0 or else deg(7\(X)) < deg(g(X)) and m(X) = 0 or else 
deg(r2(X)) < deg(g(X)). Then, 


(qi(X) — q2(X))g(X) = (r2(X) — ri(X)). 


Suppose that 7)(X) # 1ro(X). Then, (r2(X) — 11(X)) 4 0. Since R is an integral 
domain, comparing the degrees, 


deg(qi(X) — q2(X)) + deg(g(X)) = deg(r2(X) — ri(X)). 
Since 
deg(rz(X) — r(X)) < max(deg(r2(X)), deg(—r(X))) < deg(g(X)), 


we atrive at a contradiction. Thus, 7;)(X) = 12(X), and since g(X) £0, qi(X) = 
q2(X). f 


The polynomial g(X) in the above theorem is called the quotient and r(X) is 
called the remainder when f (X) is divided by g(X). 
Since every nonzero element of a field is a unit, we have the following. 


Corollary 7.6.11 Let F be a field. Let f (X) and g(X) be polynomials in F[X] such 
that g(X) & 0. Then, there exists a unique pair (q(X), g(X)) in F[X] x F[X] such 
that 

F(X) = q(X)g(X) + r(X), 


where r(X) = Oor else deg(f(X)) < deg(g(X)). tt 


The proof of the above theorem gives an algorithm to find the quotient and the 
remainder. We illustrate it by means of the following example. 


Example 7.6.12 Let f(X) = 2+ 5X + 8X? + 4X? and g(X) = 2+ 3X + X’ be 
two polynomials in Z[X]. The leading coefficient of g(X) is 1 which is a unit. Now, 
fi(X) = f(®%) — 4Xg(X) = 2 — 3X — 4X?. Further, f, (X) + 4g(X) = 10 + 9X. 
Hence 

SX) = 4-4xX)g(X) + 0 + 9X). 


Thus, the quotient is 4 — 4X and the remainder is 10 + 9X. 


Remark 7.6.13 The assumption that the leading coefficient of g(X) is a unit is es- 
sential. For example, consider f(X) = 1 + 2X + 3X? and g(X) = 3 + 6X in 
Z[X |. Suppose we have pairs (q(X), r(X)) in Z[X] x Z[X] such that 


SX) = q(X)g(X) + r(X), 


where r(X) = Oorelse deg(r(X)) < deg(g(X)). But then g(X) = a + bX for 
some a,b € Zand r(X) = c € Z. This gives 6b = 3 which is impossible. 
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Let f(X) = ao + aX + --- a,X" be a polynomial in R[X] and a € R. Then, 


f(a) = ao + aaa + aa? + --- aa" 

is called the specialization or evaluation of f(X) at a. For each a € R, we have the 
evaluation map e, from R[X] to R defined by e,(f(X)) = f(a). The map e, is a 
ring homomorphism from R[X] to R (verify) which is surjective. 


Corollary 7.6.14 (Remainder Theorem) Let R be a commutative integral domain 
with identity. Let f(X) € R[X] and a € R. Then, there exists unique q(X) € R[X] 
such that 


f(X) = q(X):-(X—-a) + f(a) 
Proof By the division algorithm, there exist q(X), r(X) € R[X] such that 
F(X) = q(X)(X —a) + r(X), 


where r(X) = O or else deg(r(X)) < 1. But thenr(X) = re R. Specializing 
the equation at a, we get f(a) = q(a)-(a-—a) + r = r. Thus, f(X) = 
q(X)(X — a) + f(a). f 


Corollary 7.6.15 ker e, of the evaluation map é, is R[X|-(X—a) = {f(X)(X—a) | 
F(X) € R[X]}. t 


We say that a polynomial g(X) € R[X] divides a polynomial f (X) if there exists 
q(X) € R[X] such that f(X) = q(X)g(X). An element a € R is said to be a 
root or a zero of f(X) if f(a) = 0. The following corollary is immediate from the 
remainder theorem. 


Corollary 7.6.16 (Factor Theorem) Let R be a commutative integral domain with 
identity and f (X) € R[X]. Then, X — a divides f (X) if and only if a is a root of f (X). 
t 


Corollary 7.6.17 let R be a commutative integral domain with identity and f (X) € 
R[X]. Let a), a2, ..., a, be distinct roots of f (X). The (X — a,)(X — az) +++ (X — a,) 
divides f (X). 


Proof The proof is by induction onr.Ifr = 1, then it reduces to the above corollary. 
Suppose that the result is true for r. Let a), az, ..., G, 4-41 be distinct roots of f (X). 
Then, by the induction hypothesis, (X — a,)(X — a)---(X — a,) divides f(X). 
Suppose that 

F(X) = q(X)(X — a1)(X — az) +++ (KX —a,). 


Evaluating at a,1,;, we gettO = f(a@41) = G(Qr41)(Qr41 — G1) +++ (G41 — G). 
Since a4; 4 aj for alli < r and R is an integral domain, g(a,41) = 0. Again by 
the above corollary X — a,+, divides q(X). Hence, (X — a,)(X — az) +++ (X — ay41) 
divides f (X). tt 
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Corollary 7.6.18 Let R be as above and f (X) a polynomial of degree n. Then, f (X) 
can have at most n distinct roots. 


Proof If f(X) has r distinct roots a1, a2, ..., a;, then from the above corollary 
f(X) = q(X)(X — ay)(X — ay) +++ (X —a,). 
Since R is an integral domain, comparing the degrees, we get that deg(f(X)) > r. tf 


Remark 7.6.19 If R is not an integral domain, then a polynomial of degree n may 
have more than n distinct roots: For example, 2 + 2X in Z¢[X] has 2 and 5 both as 
roots. 


Theorem 7.6.20 Every finite subgroup of the multiplicative group F* of a field F is 
cyclic. 


Proof Let G be a subgroup of F* of order n. Then, for each divisor d of n, the 
equation X‘ — 1 = O has at most d solutions(follows from the above corollary). 
From the illustration 1.4 of Chap. 5, G is cyclic. tt 


Remark 7.6.21 In fact, every finitely generated subgroup of the multiplicative group 
of a field is cyclic. 


Corollary 7.6.22 The multiplicative group of a finite field is cyclic. tt 


In particular, 


Corollary 7.6.23 The group U, = Z> of prime residue classes modulo a prime p 
is cyclic of order p — 1. t 


How to find a generator of U,,? This problem will be addressed later. One may ask 
another natural question: What are m for which U,,, is cyclic? Indeed, U,,, is cyclic if 
and only ifm = 2, 4, p”, or 2p", where p is an odd prime. The proof of this fact 
will also follow in Algebra 2. 


Remark 7.6.24 Further arithmetical properties of polynomial rings will be discussed 
in Chap. 11. 


Exercises 


7.6.1 Let R be a commutative ring with identity and G a semigroup with identity 
e. p and w the embedding of R and G, respectively(observe that (1) = w(e)). 
Let R’ be a ring with identity together with a ring homomorphism 77 from R to R’ 
and an identity preserving semigroup homomorphism p from (G, -) to (R’, -) such 
that 711) = p(e) = I’ the identity of R’. Show that there exists a unique ring 
homomorphism py from R(G) to R’ such that wod = nand wow = p. 


7.6.2 Let G be acyclic group. Show that R(G) is homomorphic image of R(Z). 
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7.6.3 Find the number of elements in Z,(G), where | G | =n. Is it an integral 
domain? 


7.6.4 Describe the group ring Z(G), where G is a cyclic group generated by g. 
Describe its group of units. 


7.6.4 Find the group of units of Z(G), where p is a prime and G is a cyclic group 
of order p. 


7.6.5 Let G be a finite group and R a commutative integral domain with identity. 
Show that e + g is a zero divisor for all g € G. 


7.6.6 Show that the group ring R(G) described in Example 7.6.1 is isomorphic to 
the subring of M@(IR) consisting of the matrices of the type 


ab 
ba|- 
Hence, describe its group of units as a matrix group. 


7.6.7 Describe the group ring R(G), where G is a cyclic group of order 3. What is 
the group of units? 


7.6.8 Describe the group ring Z(G), where G is a cyclic group of order 3. What is 
the group of units? 


7.6.9 Describe the group ring IR(G), where G is infinite cyclic group. What is the 
group of units? 


7.6.10 Let G be a group. Show that the map ¢« from Z(G) to Z defined by 
€(LygceG4g9) = LgeGaq is a ring homomorphism. This map is called the aug- 
mentation map. The kernel of this map is called the augmentation ideal. Show that 
the augmentation ideal is generated by the set {fe — g | g € G}. 


7.6.11 Let R be a commutative ring with identity and G a semigroup with identity. 
Show that the ring M,,(R(G)) of n x n matrices with entries in the semigroup ring 
R(G) is tautologically isomorphic to the semigroup ring M,,(R)(G). 


7.6.12 Let G be a semigroup with identity such that for any x € G, there are only 
finitely many pairs y,z € G such that yz = x (e.g., NU){0}). Let R((G)) denote 
the set of all maps from G to R. Show that R((G)) is a ring with respect to the 
point wise addition and the product defined by 


f-g@) = Xy.z=xf () 9(Z), 


and which contains R(G) as a subring. 
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7.6.13 Describe the centers of the group rings C(S3) and C(Qsg). 


7.6.14 Let R be a commutative ring with identity, and R’ a ring. Let a € R’, and 7 
a homomorphism from R to R’ such that a commutes with each element of 7(R). 
Show that 77 has unique extension p to R[X] subject to the condition p(X) = a. 


7.6.15 Let f(X) = 2 + 5X + 6X? + 8X*, and g(X) = 3 + SX + 5X?. 
Find f(X) + g(X) and f(X) - g(X). Show that f(X) cannot be expressed as f(X) = 
q(X)g(X) + r(X), where r(X) = Oorelse deg(r(X)) < deg(g(X)). 


7.6.16 Find the remainder when 1 + 6X + 8x? + 5X? is divided by X + 2. 
7.6.17 Show that X? — 2X + 2has no root inQ. 

7.6.18 Show that X2 + X + 1 has no root in Zp. 

7.6.19 Show that X?2 + T has no root in Z7. 


7.6.20 Show that the polynomial ring Z,[X] is an infinite integral domain of char- 
acteristic p #4 0. Give an example of an infinite field of characteristic p £ 0. 


7.6.21 Show that f(X) —f(q) is divisible by (X — a) for all polynomial f (X). Deduce 
that (X — a) divides X”" — a" for alln > 1. 


7.6.22 Suppose that F is an infinite field. Suppose that f(a) = 0 for alla e F. 
Show that f(X) = 0. 


7.6.23 Show that the ideal of R[X] generated by X? + 1, where R is the field of 
real numbers, is a maximal ideal. 

Hint. If A is an ideal containing R[X] - (X 24-1) properly, then there exists f(X) € A 
such that X7+ 1 does not divide f (X). Apply division algorithm to show thatX+a € A 
for some a € R. Again, apply division algorithm to show that a? + 1 € A. 


7.6.24 Show that R[X]/(R[X](X*+1)) isa field isomorphic to the field C of complex 
numbers. 

Hint. Consider the map f(X) + f(i) from R[X] to the field C. Show that it is 
surjective homomorphism. Find the kernel. 


7.6.25 Show that Z7[X]/(Z7[X](X? + 1)) is a field of order 49. 
7.6.26 Show that Zo[X]/(Z2[X](X* + X + 1)) is a field of order 4. 


7.6.27 Define a map D from R[X] to R[X] as follows: Let f(X) = ap +a,X+---+ 
a,X" € R[X]. Define 


D(f (X)) = ay + 2aaX + 3a3X? + --+ na,X"!. 
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Show that D is a derivation in the sense that 


(i) Daf(X) + bg(X)) = aD(f(X)) + bD(g(X)), 
and 
(ii) D(X) -gX) = DF(X))- GX) + f(X)- D(g(X)). 


7.6.28 Define D” inductively as follows: 

Define D°(f(X)) = f(X), D'(f(X)) = D(f(X)). Supposing that D’(f(X)) has 
already been defined, define pit! (f(X)) = D(D'(f (X))). Let f (X) be a polynomial 
in R[X] of degree n. Show that for each r < n, there exists a unique polynomial g(X) 
in R[X] such that r!g(X) = D’(f(X)). We denote the polynomial g(X) by 77), 


7.6.29 (Taylor’s Formula). Let f(X) € R[X] be a polynomial of degree n. Show that 
D? D" 
flat+h) = f(a) + hDG¢@) + pot) ge pr PX E@ 


n\ 
foralla,h eR. 


7.6.30 Calla polynomial f (X) of positive degree irreducible if it cannot be expressed 
as product of polynomials of lower and positive degrees. Let f(X) € F[X] be an 
irreducible polynomial of positive degree over a field F. Let F’ be a field containing 
F asa subfield. Let a € F’, and f(X) € F[X] be a polynomial of least degree of 
which a is a root. Show that f(X) is an irreducible polynomial in F[X]. 


7.6.31 Let F’ be a field containing F as a subfield. Let a € F’ and f(X) € F[X]. An 
element a € F’ is said to be a multiple root of f(X) in F’ if (X — a)* divides f(X) 
in F’[X]. Show that @ is a multiple root of f(X) if and only if a is also a root of the 
derivative D(f(X)) of f(X). 


7.6.32 Suppose that F is a finite field, or it is a field of characteristic 0. Let F’ be 
a field containing F as a subfield which contains all roots of a nonzero irreducible 
polynomial f (X) in F[X](we shall see in Algebra 2 in the chapter on Galois theory 
that such a field always exists). Use the above exercise to show that there are exactly 
n distinct roots of f(X) in F’, where n is the degree of f (X). 


7.6.33 Is the above result true for infinite fields of characteristic p 4 0? Support. 


7.7 Polynomial Ring in Several Variable 


Let W denote the set of all nonnegative integers, and G = W’. Then, G isa 
semigroup with respect to coordinate-wise addition. The identity being (0.0.--- , 0). 
Let R be a ring with identity. Consider the semigroup ring R(G) and the embeddings 
¢ of R and w of G as described in the beginning of the previous section. Let us denote 
w((, 0,0, ...,0)) by X1, WC, 1, 0,...,0)) by X2, ..., Y((0, 0, ..., 0, 1)) by X,. 
Identify ¢(a) by a. Then, every element of R(G) is uniquely expressible as 
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ee i EW? Ciyini, Xy Xy + XP. 
The ring R(G) is generated by RU {X1, X2,...,X,}, and it is denoted by 
R[X|, X2,..., X;]. This ring is called the polynomial ring in r variables. {X,, 
Xo,...,X,} are called the indeterminates of the polynomial ring. An element of 
R(X), X2, ..., X,] is usually denoted by f(X1, X2,..., X-). The nonzero expressions 
Qi ip--i,X{'X5 +++ X!” appearing in the polynomial f(X1, X2,...,X,) are called the 
monomials in the polynomial f(X,, X2,..., X,). The sum i; +i: +--- +i, is called 
the degree of the monomial. The degree of a maximum degree monomial appearing 
in a nonzero polynomial is called the degree of the polynomial. As in case of one 
variable polynomial, R is an integral domain if and only if R[X), Xo,..., X;] is an 
integral domain, and then, the degree of the product of two polynomial is sum of 
their degrees. 

Every nonzero element (X1, X2) € R[X,, X2]is uniquely expressible as amember 


ag(X1) + a(X1)X2 + an(Xi)XF + +++ + a,(X1)XF 


of R[X, ][X2]. This identification of elements of R[X,, X2] as elements of RLX,][X2] 
also respects the operations in the corresponding rings. As such, the ring R[X, X2] 
is same as R[X ][X2]. More generally, by induction, R[X1, X2,..., X41] = RIM, 
X,..-, XX]. 

Let R be a commutative ring with identity. Let Map(R’, R) denote the set of 
all functions from R” to R. Clearly, Map(R’, R) is a ring with respect to pointwise 
addition and multiplication. Define a map go from R[X,, X2,..., X,] toMap(R’, R) by 


(F(X, Xo, eee | X,)) (ar, a2, ila | Qy)) = f(a, 2, clic ay). 


Since R is commutative, go is a ring homomorphism. The elements of the image of 
g are called the polynomial functions. They are also called the regular functions 
on R’. In general, a function need not be a polynomial function. For example, the 
function f from R to R defined by f(t) = sint is not a polynomial function. 
Fix an element @ = (aq), Q2,...,a@,) € R’. Define eg from R[X), Xo, ..., X;] to 
R by 
ex(f (X1, X2,...,X-)) = flair, a2,..., ay). 


Again, since R is commutative, eg is a surjective ring homomorphism. This homo- 
morphism is called the evaluation map. It is also called the specialization map 
at a. 

Let F be a field. The set F’”” is called an affine m-space. This set is denoted by 
A” (F). Let f(X1, Xo, ..., Xm) be a polynomial in m variable. An element @ of A” (F’) 
is called a zero of f(X1, X2,...,Xm) if f(ai, a2,...,Qm) = O, or equivalently, 
f(Q&%, X2,...,Xm) € ker eg. The set of all zeros of f(X1, X2,..., Xm) is denoted by 
V(f), and it is called a hypersurface of the affine m- space A"(F) = F”. 


7.7 Polynomial Ring in Several Variable 263 


Example 7.7.1 The circle S' = {(x,y) € R? | x? +y* = 1} is a hypersurface 
V (f) of A7(IR), where f(X, Y) = X* + Y* — 1. If we treat f(X, Y) as polynomial 
in R[X, Y, Z], then V(f) is a right circular cylinder which is a hypersurface in R>. 


If F is a field, then F[X,, X2,..., X] is an integral domain. Its field of quotients 
is denoted by F(X), X2,..., Xm). The members of F(X, X2, ..., Xm) are of the form 
[oe where g(X1, X2,..., Xm) 4 0. In short, this element is denoted by A 
Proposition 7.7.2 Let F be an infinite field, and f (X\, Xz, ..., Xm) be a polynomial 
in m variable. Then, V(f) = A™(F) if and only if f (X,, X2,...Xm) = 0. 


Proof Vff(X1, X2,...Xm) = 0, thenclearly V(f) = A” (F). We prove the converse 
by the induction on m. If f(X) is a nonzero polynomial in FX], then the set V (f) of 
zeros of f (X) is a finite. Since F is infinite, V(f) 4 A! (F).This proves the result for 
m = 1. Assume the result for m. Let f(X1, X2,..., Xm+1) be a nonzero polynomial 
in m-+ 1 indeterminates. Then, 


f(X1, X2,.-.,Xm41) = 

ago (X1, siiaiia Xm) a ay (X1, sisi Xm)Xm41 a eae a,(X}, ee Xm)Xna ps 

where a,(X1,..., Xm) iS a nonzero polynomial in m indeterminates and r > 1. 
Thus, this polynomial is a nonzero polynomial in one variable over the field K, 
where K = F(X, X2,..., Xm). Since it has only finitely many zeros in K, there 
is an element a,,,; € F such that f(X,, Xo,..., Xm, m+1) # 0. The polynomial 
g(X1, X2,...,Xm) = f(X1, X2,...,Xm; Am+1) 1S a nonzero polynomial in m in- 
determinates. By the induction hypothesis, there are elements a}, Q2,..., Qj, in F 
such that g(a}, a2,..., @m) # 0. This means that (a1, a2,..., @mzi1) € V(f). tt 


Corollary 7.7.3. Let F be an infinite field. Then, the map 9 from F(X, X2,..., X;] 
to Map(R’, R) defined by 


(F(X, Xo, see X,)) (a, Q2,-++5 a,)) = f(a, O2,.-. a;) 
is an injective map. 
Proof From the above proposition, it follows that the kergfo = {0}. tt 


Proposition 7.7.4 Let F be an algebraically closed field in the sense that every 
polynomial over F has a root in F (e.g., C). Then, V(f) 4 Y for every polynomial f 
in F[X,, Xo, ..., Xml. 


Proof We prove the result by the induction on m. If m = _ 1, then the result fol- 
lows from the assumption that F is algebraically closed field. Assume the result for 
m. Let f (X1, X2,..., Xm+1) be a polynomial in F[X), X2,..., Xn4i]. If it is a zero 
polynomial, then there is nothing to do. Suppose that it is a nonzero polynomial. 
Then, 
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F(X, Xo, tae »Xm+i) = 
ag(X1, a) Xm) oF ay(X, tee Xin)Xm+1 at eae ay(X1, see Xm)Xnv i 


where a,(X1,..., Xin) is a nonzero polynomial in m indeterminates and r > 1. By 
the induction hypothesis, we have aj, a2,..., @m» such that a,(a1,...,Qm) = 0. 
Consider the polynomial g(Xin41) = fai, Q2,.--, Am, Xm41) in F[Xn+1]. Since F 
is algebraically closed field, there is a,,; € F such that (a), Q2,...,Qm, Qm+1) € 


V(f). t 


Proposition 7.7.5 Let F, be a finite field of orderqg = p", n = | (in fact, it 
will follow (see Algebra 2, Chap.9) that there is one and only one such field up to 
isomorphism). Let f (X,, X2,...,Xm) be a nonzero polynomial over Fg such that 
the degree of f in each X; is less than q. Then, V(f) 4 F"",i.e, there are elements 
Q1,Q2,..., Am in Fy such that f (ay, a2,..., Am) # 0. 


Proof The proof is by the induction on m. If f(X;) is a polynomial over F in 
one indeterminate, then since degf(X,) < q, it has at the most g roots. Hence, 
there is an a € Fy such that f(a~) ¢ 0. Assume that the result is true for m. Let 
Sf (X1, X2,..., Xm41) be a nonzero polynomial over F, such that the degree of f in 
each X; is less than q. Then, 


F(X, Xo, tae Xm+i) = 
ag (X1, ..-, Xm) + a1 (X1,..-,Xn)Xmg1 +++ +a (XM, .-- Xm)Xnap 


where a,(X1, ..., Xm) is a nonzero polynomial in m indeterminates, | < r < g, and 
the degree of a,(X,,..., Xi) in each x; is less than q. By the induction hypothesis, 
there are elements a1, 42, ..., Q in Fy, such that a,(Q1, Q2,..., @m) A 0. In turn, 
gGXm+i) = far, @2,.--, Am, Xm41) iS a polynomial in one indeterminate, and its 
degree is less than g. Hence, there is an element a,,,; € F such that g(Qmn41) = 
f(a, O12, +--+, Am, Om+1) #0. tt 


Corollary 7.7.6 The map from F4[X,, Xz, ...Xm] to Map(F7, F,,) is a surjective 
map,i.e., every map from F7' to F is determined by a polynomial. Indeed, it is 
determined by a unique polynomial f (X,, X2, ..., Xm) having the property that the 
degree of f (X,, X2,..., Xm) in each X; is less than q. 


Proof Consider the subset 
A = {f(X, X2,...,Xm) | degree of f in X; is less than q for each i}. 


It follows from the above proposition that s restricted to A is an injective map. 
Clearly, the number of elements in A is g@ which is the same as the number of 
elements in the set Map(F'”, F,). Thus, g is a bijective map from A to Map(F”, F,). 
In turn, it is a surjective map from Fy[X,, X2,...Xm] to Map(F 7, F,). 


Proposition 7.7.7 Let f(X, X2,...,Xm) € Fa[X1, X2,...,Xm]. Then, there exist 
polynomials qj(X1, X2,...,Xm) € Fa[X1, X2,..-,Xm], i = 1,2,...,m, anda 
polynomial r(X,, Xo, ..., Xm) such that 
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St (X11, X2,..., Xm) — 
> aan 1gi(X1, Xo, cee Xm) (X — Xj) ag r(X1, Xo, cee Xm); 


where r(X1, X2,..., Xm) is zero, or else degree of r(X\, X2,..., Xm) in each X; is 
less than q. 


Proof Form = 1, this is just the division algorithm theorem. Suppose that m > 1. Let 
F(X, X2,..., Xm) € Fg[X1, X2,..-, Xm). Since X/, — X,, is anonzero polynomial in 
Fy[X1, X2, ..-, Xm—1][Xm], by the division algorithm theorem, there are polynomials 
Qm(X1, X2,.-., Xm) and ry (X1, Xo, ..-, Xm) in Fg[X1, Xz, .-.,Xm—1][Xm] such that 


f(%, X, ae Xm) S 
Qm(X1, X2, tees Xm)(Xn => Xm) + rn (X1, X2, tee Xm), 


where 7,(X1, X2,...,Xm) is 0, or else degree of %,(X1, X2,...,Xm) in Xp is 
less than q. If r,(X1, X2,..., Xm) is 0, then there is nothing to do. Suppose that 
In (X1, X2, ..., Xm) 4 0. Then, as before, treating it as an element of F,[X), X2,..., 
Xm—-2,Xm][Xm-1], we get polynomials qn—1(X1, X2,..-,Xm) and rm—1(X1, X2, 
...,Xm) such that 


Tin (X1, X2,-.-,Xm) = 
Am—1(X1, X2, tee Xm Xe 4 ~— Xm—1) + rm—1(X1, X2, tee Xm), 


where rm—1(X1, X2, ..., Xm) is 0, or else degree of rm—1(X1, X2,..., Xm) in Xm—-1 is 
less than g. Suppose that 7,,_;(X1, X2,..., Xm) in Xm_1 4 O. Since the degree of 
1m(X1, X2,..., Xm) in Xp is less than g, the degree of 1,-1(X1, X2,..., Xm) in Xm is 
less than g. Proceeding inductively, we arrive at the desired result. tt 


The following corollary is immediate from the above proposition. 


Corollary 7.7.8 The kernel A of the map 9 is the ideal generated by the set 


{Xf —X1, XJ —X2,...,X4 — Xm}. tt 


Corollary 7.7.9 F,[X,, X2,..., Xm]/A is isomorphic to the ring Map(F7, Fy). ft 


Theorem 7.7.10 (Artin—Chevalley) Suppose that f(X,,X2,...,Xm) € FqlX, 
X2,...,Xm] is a polynomial of degree less than m such that f(0,0,...,0) = 0. 
Then, V(f) 4 {(0,0,..., 0)}, ie, there exist a1, Q2,..., Am not all zero such that 
faa, Q2,--+,Am) = 0. 


Proof Suppose that V(f) = {(0,0,...,0)}. Consider the polynomial @(X1, X2, 
2.2, Xm) = 1 — f(Xy, Xo,...,Xm)I!. Since f(0, 0,...,0) = 0, (0,0,...,0) ¢ 
V(@). If (a1, Q2,..-, Am) 4 (0,0,..., 0), then by the supposition, f(a;, a2,..., 
Qm) ~ 0. But then (a1, a2,...,Qn) € V(@). This shows that V(¢) = a _ 
{(0, 0, ..., 0)}. Consider the polynomial 
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1 


PO 26... Sa a ae), 


Clearly, V(¢) = V(P). Consequently, ¢ — P € kergo. Since the kerg is the ideal 
generated by the set 
{X7 — X,, XJ —X,..., X4 


m 


— Xn}, 


and the degree of P is m(q — 1), it follows that the deg(¢) => m(q — 1). Since 
the degree of f is assumed to be less than m, we arrive at a contradiction. Hence, 


Vf) A {(0, 0, ..., OD}. tt 


Theorem 7.7.11 (Warning’s Theorem) Let f(X), X2,...,Xm) be a member of 
F,[X1, X2, ..., Xml], and the degree of f is less than m. Then, | V(f) | is divisible by 
the characteristic p of F4. 


Before proving the Warning’s theorem, we establish the following lemma. 


Lemma 7.7.12 Let f be a map from N \){0} to F, defined by f(r) = Xxer,x" (by 
convention x®° = 1). Then, f(r) = —lifr > Oandq—\ divides r. Further, 


f(r) = O, otherwise. 


Proof Clearly, f(0) = q1 = 0.Suppose that r € N. If g—1 divides r, thenx” = 1 
for all x # 0, and, of course, 0" = 0. Hence, f(r) = (q-—1)1 = —1. Next, 
suppose that g — | does not divide r. Then, there is a nonzero element a € F, such 
thata” #1. Now, a’f(r) = a Xyerx” = Ler, (ax)! = f(r). Since a’ # 1, 
f = 0. tt 


Proof of the Warning’s Theorem. 


Let f(X1, X2,...,Xm) € Fg[X1, X2,..., Xm] be a polynomial whose degree is less 
than m. Consider the polynomial g(X), X2,...,Xm) = 1 -— f®%, 
Xo,...,Xm)e1. Clearly, g(a1, Q2,..-,Q@m) = lif and only if (a1, a2,...,Qm) € 
V(f). Further, g(a1, Q2,..-,Qm) = Oif (aj, Q2,..., Am) ¢ V(f). Therefore, it is 
sufficient to show that 


Dilai-00 ai Omer” (O41, OQ, 00g Qn) = 0. 


Next, observe that deg(g) < m(q—1). Hence, the degree of each monomial appearing 
in g is of degree less than m(q — 1), and so the degree of some X; appearing in each 
monomial of g is less than g — 1. Let aj,i,...;,, X{|X> -- - Xi" be a monomial appearing 


In 


in g(X1, X2,..., Xm). We may assume that i, < q — 1. From the above lemma, for 
fixed members Qj, Q2,..., Qm_— 1 Of Fy, 
iia im-1 im __ 
og Fy Git ip: -+in OY Oy Ay {Xm = 0. 
Hence, 


im-1 


im 
pene m1%m = 0. 
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In turn, it follows that 
Lay ,a0 re Om )eFn J (Q4] 52, +++, Qm) = 0. 


tt 


Corollary 7.7.13 Let f,, fo, ...f- be polynomials over Fy in m variables. Suppose 
thatm > %X! _ ,deg(f;), and each f; is without constant term (i.e., f;(0) = O for alli). 
Then, 

VA VAL): (VG) 4{@,0,...,0}. 


Proof Take f = fif2---f-, and apply the Warning’s theorem. ft 
Exercises 


7.7.1 Let F bea field, A”(F) the affine n- set, and F[X), X2, ...X,] the polynomial 
ring inv variables. Let A be a subset of F[X,, X2, ...X,], and V (A) denotes the subset 
Area V (f). This defines a map V from the power set of F[X,, Xz, .. .X,] to the power 
set of A"(F) = F". The sets of the form V(A) are called affine algebraic subset 
of the affine n-set. Show that 


G@) V({O}) = A*(F). 
Gi) V({l}) = 9B. 
ii) V(A) = V(<A>) = V(V<A >). Thus, V is not injective. 
(iv) Show that arbitrary intersection of affine algebraic sets are affine algebraic. 
(v) Finite union of affine algebraic sets are affine algebraic. 
(vi) The family T of compliments of affine algebraic sets form a topology on A" (F). 
This topology is called the Zariski topology on A” (F). 


7.7.2 Show that {(cos t, sin t, t) | t € R}, where R is the field of real numbers, is 
not an affine algebraic set. Thus, V need not be a surjective map. 


7.7.3 Let Y bea subset of the affinen-setA"(F). LetI(Y) = {f € F[X,, Xo,...Xn] | 
Y C V(f)}. Show that 7(Y) is a radical ideal such that Y C V(I(Y)). 


7.7.4 Show that VU(V(A))) = V(A). 
7.7.5, Show that V and / are inclusion reversing maps. 


7.7.6 Let Y be a subset of the affine m-set A"(F). The ring F(Y) = F[X, Xo, 
..;Xm]/I(Y) is called the coordinate ring of Y. Show that the coordinate ring 
I'(Y) is a reduced ring. 


7.7.7 Consider the line L = V(f) in R?, where f(X,Y) = Y-—mX+c,m40. 
Show that I"(L) is isomorphic to the polynomial ring R[t] in one indeterminate. 


7.7.8 Consider the parabola P = V(Y — X*) in R?. Show that '(P) is isomorphic 
to (LZ). Observe that P and L are not congruent. 


268 7 Elementary Theory of Rings and Fields 


7.7.9 Consider the circle S' = V(X? + Y? — 1) in R’. Show that P(S') is not 
isomorphic to (ZL). What is '(S 9 


7.7.10 Identify the coordinate ring PV (Y), where Y = {X? + ¥?+Z?-1,X—Y}. 
7.7.11 Can the ring M2(R) be coordinate ring of algebraic subset? Support. 


7.7.12 Is T(Y) always an integral domain? If not, under what condition it can be an 
integral domain? 


Chapter 8 
Number Theory 2 


Chap.3 was devoted to some elementary results in arithmetic such as division 
algorithm, Euclidean algorithm, fundamental theorem of arithmetic, and solutions of 
linear Diophantine equations and of linear congruences. Further, in Chaps. 4, 5, and 
7, we proved some fundamental results such as Wilson theorem, Euler-Fermat’s the- 
orem as applications of group theory and ring theory. In this chapter, we study arith- 
metic functions, Quadratic residues, Quadratic reciprocity law, nth power residues, 
and nonlinear Diophantine equations. 


8.1 Arithmetic Functions 


Definition 8.1.1 A map f from the set N of natural numbers to the field C of complex 
numbers is called an arithmetic function. 


Let A(N, C) denote the set of all arithmetic functions on N. Then, A(N, C) is 
an abelian group with respect to the pointwise addition. Define a multiplication * in 
A(N, C) by 

(f*g)() = Lad =nf (di)g (dr). 


This multiplication is called the Dirichlet multiplication. Now, 


((fxg)*xh)(n) = Yaa =nl(f x ghd) = Laas = n(Lad = aha) = 
Ldideds =n Jf (di)g(da)h(a3) = (f * (g*h)(n) forall n. 


Thus, * is associative, and so A(N, C) is a semigroup with respect to x. It is also easy 
to see that * distributes over +, and so A(N, C), +, *) is a ring. Further, 
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(fF * gn) = Lad =nf di)g(dr) = (g* f)(n). 


forall f, g € A(N, C). Thus, A(N, C) is a commutative ring. 
Let e be a map from N to C defined by 


lifn=1 


Evel O otherwise. 


Then, (fxe)(n) = Yaa anf (dije(ao2) = f(n), and similarly, (ex f\(n) = 
f(n) foralln € N. Thus, e is the identity of the ring A(N, C) of arithmetic functions. 

The exponential function n —> e”, the log function n —> log(n), the factor- 
ial function n —~> n!, the function n —> ./n, and the functionn —> 1+ 5 + 7 + 
veep 7 — logn are some examples of arithmetic functions. We shall be more inter- 
ested in functions f such that f(n) reflects some arithmetic properties of n, and not 
only the size of n. Following are some important such examples. 


Example 8.1.2 The Euler’s phi function ® from N to N already defined in Chap. 4 
(@(1) = 1, and forn > 1, ®(n) = the number of positive integers less than n 
and co-prime to 7) is an arithmetic function. 


Example 8.1.3 The divisor function 7 from N to N (7(n) = the number of divisors 
of n) is an arithmetic function. divisor function. 


Example 8.1.4. The function o from N to N defined by 
a(n) = Land, 


is an arithmetic function. This is called the 
sum of divisor function. 


Example 8.1.5 The function o;,, k > 1 defined by 
oy(n) = Lajnd* 
is another arithmetic function. 


Example 8.1.6 (Mobius function) The arithmetic function yz from N to Z defined by 
@ wd) = 1, 


Gi) un) = 0, if a? divides n, for some a > 1, and 
Gii) uw(n) = (—1)’, ifnis product of r distinct primes 
is called the Mobius function. 


Example 8.1.7 The identity map Jy from N to N is another example of an arithmetic 
functions. 
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Example 8.1.8 For k > 1, we define an arithmetic function ®, from N to N as 
follows: ®,(1) = 1,andforn > 1, ©,(7) is the number of k-tuples (a1, a2, ... ax) 
of positive integers such that a; <n forall i and the g.c.d (aj, Q2,...,%,n) = I. 
What is ®,? 


Example 8.1.9 We have another arithmetic function from N to the field R of real 
numbers defined by \(n) = log p, if n is a power of some prime p, and zero, 
otherwise. 


Example 8.1.10 The constant function {| from N to N defined by (7) = 1 is an 
arithmetic function. 


Definition 8.1.11 An arithmetic function f from N to C is called a multiplicative 
function if f is not identically zero, and 


(m,n) = limplies that f(mn) = f(m)f(n). 
It is said to be an absolutely multiplicative function if 


f(mn) = f(m)f(n) forallm,neéN. 


Remark 8.1.12 If f is multiplicative nonzero function, then f(1) = 1, for(m, 1) = 
l,andso f(m) = f(m)f() forallm EN. 


Example 8.1.13 The identity element e of A(N, C) defined earlier is multiplicative. 
In fact, it is absolutely multiplicative(verify). 


Example 8.1.14 The function J defined above is multiplicative. In fact, this is also 
absolutely multiplicative. 


Example 8.1.15 The identity function Jy is also absolutely multiplicative. 


Proposition 8.1.16 The divisor function T is multiplicative, and 
T(py' ps”... pe’) = (a+ 1)(a24+))...(a- + D, 


where Pi, P2,.--, Pr are distinct primes. 


Proof Suppose that (m,n) = 1. It is evident that every divisor d of m -n can be 
uniquely expressed asd = dj - dz, where dj is a divisor of m and dp is a divisor of n. 
Thus, the number of divisors of m - n is the product of the number of divisors of m and 


that of n. This shows that T(@m-n) = T(m)-7T(n). Further, if p is a prime number, 
then 1, p, p*,... p® are precisely the divisors of p®. Hence, T(p®) = a + 1. By 
the multiplicative properties of 7, the result follows. tt 


Proposition 8.1.17 Let f and g be multiplicative functions. Then f x g is also 
multiplicative. 
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Proof Suppose that (m,n) = | anduv = mn. Then,u = d)d2 andv = d3da, 
where d;/m, do/n, d3/m, and d4/n. Clearly, (dj,d2) = 1 = (d3,d4). Also, 
d|dz; = mandd)d4, = n. Since f and g are multiplicative, f(u) = f(d\) f(d), 
and g(v) = g(d3)g(dy). Thus, f(w)g(v) = f(di)g(ds) f (d2)g (ds). Hence, 


(f * g)(mn) S Luv =manf Uu)g(v) = 
Layds = midody = nf (d1)8(d3)) (Ff (do) g (da) = 
(Lada, = mf (d1) 8 (d3)) - (Saga, = nf (d2)g(da)) = (f * g)(m)- (f * g)(n). 


Thus, f * g is multiplicative. tt 


Let M(N, C) denote the set of all multiplicative functions. Following result is the 
restatement of the above proposition. 


Corollary 8.1.18 M(N, C) is a sub-semigroup of (A(N, C), *) which also contains 
the identity e. tt 


Corollary 8.1.19 Let f be a multiplicative function. Define a function F by 
F(n) = Xan J (d). 


Then F is also multiplicative. 


Proof The result follows from the above proposition if we observe that 


= fri. tt 
Proposition 8.1.20 The function o;(in particular 0) is multiplicative function, and 


(ay +Dk (ay+Dk (ap +k 
Dy —1 Px —l1 pe =" 


Qa, a2 eo L . 
OK(P,' Pr’ - )= Pal re ee 
where Pi, P2,.--, Pr ave distinct primes and k > 1. 


Proof By the definition, o,(”) = Dajnd*® = Layntg(d), where t is defined by 
te(n) = n*. Since t is multiplicative(in fact, absolutely multiplicative), the multi- 
plicativity of o; follows from Corollary 7.1.19. Next, 


(atk _ 4 
pe-l 


ox(p°) = Laypod® = + ph + (ph +--+ (p2k = P 
The result follows from the multiplicativity of ox. tt 


Example 8.1.21 The number of divisors of 200 is 7(200) = (3-57) = 
Gt 1)-(2+1) = 12, and the sum of divisors of 200 is 0(200) = o(23-57) = 


2t=1) 53-1 
top sor = 


Proposition 8.1.22 The Mébius function pi is multiplicative. 
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Proof Suppose that (m,n) = 1. Ifm orn is 1, then since w(1) = 1, w@nn) = 
Lu(m) (1). Suppose that m # 1 #n. If a square greater than | divides m or n, then 
it also divides m - n, and so in this case, u(m-n) = 0 = y(m) p(n). Suppose that 
m is product of r distinct primes and n is product of s distinct primes. Then, since m 
and n are co-prime, m - n will be product of r + s distinct primes. By the definition, 
pm -n) = (-D* = (-D (HDS = won) p(n). t 
Proposition 8.1.23 

_flifn=1 

%ajnt(d) = 0 otherwise 


Thus, e(n) = Yajnp(d). 


Proof Let 7 denote the function given by 7(n) = Lg/npi(d). We show that 7 = 
Since 1 is multiplicative, 7) is multiplicative. Ifn = 1, then Xgjjp(d) = wd) = 
Next, fora > 1, 7(p%) = Lapop(d) = 23 ol(p”) = 1-14+0+0+4+-:--+ 
0 = O. Since 77 is multiplicative, the result follows. ft 


| 
= 


Corollary 8.1.24 The function | is the inverse of the Mébius function p in the 
semigroup A(N,C) of arithmetic functions with respect to the Dirichlet product. 
More precisely, [* 4 = e = wr. 


Proof From the above proposition, we have 
e(n) = Lamp(d) = Law =np(dA)V(d') = wx {n). 


Thus, wx fs = e. Similarly, [* wu = e. tt 


Theorem 8.1.25 (M6bius inversion theorem) Let f be an arithmetic function, and 
F the function defined by 
F(n) = Xan J (d). 


Then 
FQ) = Laynpld)F (4). 
Proof 
F(n) = Xan fd) = Yaa =nf(@Ua) = (f *D@). 
Thus, 
F= fel = U«f 
Now, 
pw F = pxeQ«f) = (ureDef =erxf =f. 
Hence, 


f(r) = Yaa = nd) F (d') = Lajnpld)F (4). 
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Corollary 8.1.26 Anarithmetic function f is multiplicative if and only if the function 
F defined by 
F(n) = Xan f (a) 


is multiplicative. 


Proof We have seen that F = fx and f = wx F. Since yu and { are both 
multiplicative, and the Dirichlet product of multiplicative functions is multiplicative, 
the result follows. tt 


Corollary 8.1.27 The Euler’s phi function ® is multiplicative, and 
O(n) =n- Dain 2. 


Proof By Theorem 4.5.37, 
n= LajnP(d) 


or 
Iy(n) = YajnP(d). 


Since Jy is multiplicative, by the above corollary, ® is multiplicative. Further using 
Mobius inversion theorem 


O(n) = Laynw(d)Iy(G) = n> Lan". i 
Proposition 8.1.28 Let f be a multiplicative function. Then 


Land) f(d) = [J (1 — f(p)). 


p/n,p a prime 


Proof Since y and f are both multiplicative and product of multiplicative functions 
is multiplicative, 4.- f is multiplicative. Hence, the function 7 defined by n(n) = 
Xajnt(d) f (d) is multiplicative. Thus, it is sufficient to show that n(p°) = 1 — 
Ff (p), where p is a prime and a > 1. Now, 


n(p%) = D$-ou(p*) f(p®) = vO) fC) + up) f(p) = 1- fp). tt 


Corollary 8.1.29 Dan“ = [inp « primed — 5). 


1 


Proof The map n ~» ;, is a multiplicative function. Apply the above proposition. { 


Corollary 8.1.30 O(n) = n-[] 


1 
p/n,pa pine’! ~~ p> 


id) 1 
Proof O(n) =n- Lajn | =i ‘iene wae ~ p?t t 
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Example 8.1.31 The number of positive integers less than 200 and co-prime to 200 
is 
(200) = (27-5?) = 200-(1—4)-(1—4) = 80. 


The number of positive integers less than 200 and not co-prime to 200 is 200 — 80 = 
120. 


Example 8.1.32 xT = . Equivalently, Lajnu(d)T(5) = 1 for all n. 


Proof tT(n), by definition, is the number of divisors of n. Thus, 7(1) = Ya/nf(d). 
Using Mobius inversion theorem, 


1 = {@) = Lajnp(d)r (4) Ya. t 
Example 8.1.33 juxo = Ivy or equivalently,n = Yajnp(d)o(5) for all n. 
Proof Since a(n) is the sum of divisors of n, 

a(n) = Lajnd = LajnIy(d). 
Using Mébius inversion theorem, 


n= Iy(n) — Lan-(d)a(5). t 


Example 8.1.34 Xgjn | u(d) | = 2", where r is the number of distinct primes divid- 
ing n. 


Proof The map n ~»| j(1) | is clearly a multiplicative function. Thus, the map x 
defined by 


x(2) = Lain | wd) | 
is multiplicative. Further, 
x(p") = Layp> | wd) |= wd) + wp) + 04+ +--+ 0 = 2. 


The result follows from the fact that y is multiplicative. tt 


Example 8.1.35 Xajn (u(d))*(@(d))? = Whine ‘ eine —2p+2). 


Proof Since y and ® are multiplicative and the products of multiplicative functions 
are multiplicative, 
nn (uu(n))?(®(n))* is multiplicative. Hence, the function 77 defined by 


mn) = Lajn(u(d))*(®(d))" 
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is multiplicative. Thus, it is sufficient to show that n(p%) = p* — 2p +2 for all 
a> 1. Now, fora > 1, 


mp) = (uD (@M) + (up)? (@(p))?_ + 0---+ 0 = 
Le (pal = pr 2p 2, 


Exercises 


8.1.1 Show that f € A(N, C) is a unit if and only if f(1) 4 0. Find the inverse of 
f assuming that f(1) 4 0. 


8.1.2. Show that if f is a nonzero multiplicative function, then f(1) = 1. 
8.1.3. Show that inverse of a multiplicative function is also multiplicative. 
8.1.4 Show that inverse of in M(N, C) is {. 

8.1.5 Find the inverse of 7 in M(N, C). 

8.1.6 Find the inverse of o in M(N, C). 

8.1.7 Determine the inverse of ®. 

8.1.8 Find the number of divisors of 1000. 

8.1.9 Find the sum of divisors of 1538. 

8.1.10 Find the sum of squares of divisors of 1000. 


8.1.11 Let f be a multiplicative function. Show that the function h defined by 
A(n) = Lan fd) 


is a multiplicative function. 


8.1.12 Show that X2/,p(d) =| p(n) |. 
Hint. Observe that both sides are multiplicative and so verify it for p®. 


8.1.13 Prove the following generalizations of the results about the Euler’s phi func- 
tion to the function ®,. 


(i) n® = DajnO, (A). 
(ii) ®, is multiplicative. 


(iii) O(n) = nk “Nea aaeae! a or): 


8.1.14 Find the number of positive integers less than 1000 and co-prime to 1000. 
Find the number of positive integers less than 1000 and not co-prime to 1000. 


8.1.15 Find the number of pairs (a, 3) € N x N such that a < 100, @ < 100, and 
the g.c.d (a, 3, 100) = 1. Find the number of pairs (a, 3) such that a < 100, 6 < 
100, and (a, 3, 100) £ 1. 


8.1 Arithmetic Functions 277 


8.1.16 
8.1.17 
8.1.18 


8.1.19 
8.1.20 


Find £72 .y(n). 


Find &y/1000/4(d). 


Show that Zyjn(4)®(d) = [1 y/n.p a prime(2 — P)- 


wd) __ p-2 
Show that Lajn ®(d) —_ ee a prime p—I* 


Find the expression for 


(ud) 
Yajn (®@)? 


in terms of the primes dividing n. 


8.1.21 
8.1.22 


8.1.23 
8.1.24 
8.1.25 
8.1.26 


8.1.27 
[a, 6] 


8.1.28 
8.1.29 


8.1.30 


dyy2 
Show that Lad/n ue = Bw . 


Show that Lajn(7(d))?_ = (LajnT(d))’. 


Find E4/100 BOs. 

Show that 7 (7) is even if and only if n is not a square. 

Show that o(7) is even if and only if ./n and Je are irrational numbers. 
Show that ®(nm) = ®((n,m))- O([n, m)). 


Show that 7(n) is the number of pairs of positive integers (a, 2) such that 
= Nn. 


Show that A(n) = —Xg/np(d)logd. 
Let oo¢a(n) denote the sum of positive odd divisors of n. Show that 
n+d 
Foaa(n) = Lgyn(—l) 4 -d. 


Call a number 7 a perfect number if a(n) = 2n. Show that if 2” — 1 is 


a prime number, then 2”~!(2” — 1) is a perfect number. Conversely, Euler proved 
that every even perfect number is of this form. Thus, finding even perfect number is 
equivalent to finding primes of the form 2” — 1. Such primes are called the Mersenne 
primes. It is not known whether there are infinitely many Mersenne primes, or equiv- 
alently infinitely many even perfect numbers. It is also not known whether there are 
odd perfect numbers. 


8.1.31 


Let f: Nx N--C be a nonzero function. Define a function F from 


N x Nto C by 


F(m,n) = Xa /mad,/nf (di, a2). 


Prove the following inversion formula: 


f(m,n) = Layjmdejnf(d) (da) F(™, 2). 
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8.1.32 The function [] from the field R of real numbers to Z defined by [](x) = 
the largest integer < x is called the bracket function. The image of x is denoted 
by [x] and is called the integral part of x. Prove the following properties of the 
bracket function: 


Gi) O<x—[x] < 1.x — [x] is called the fractional part of x. 

Gi) [x +1] = [x]+1. 

Gii) The function f defined by f(x) = [x]+ [— x] is zero at integers and —1 at 
nonintegers. 

(iv) [x+y] < [x] + Ly]. 

(v) Ifa,meZandm > 0, thena — [F] -m is the remainder when a is divided 
by m. 

(vi) [x] + ly] + [x+y] s [2x]+ [2y]. 


8.1.33 Let n be a positive integer and p a prime. Let ord,n denote the exponent of 
the highest power of p dividing n. Show that 


ord,n! = ZiEal yl. 


8.1.34 Let ®(x, 1) denote the number of positive integers less than or equal to x 
and co-prime to n(x > 2). Show that 


La/nP (3, 9) = [x]. 


Deduce that 


P(x,n) = Lanp(a)l4l- 


8.1.35 Let f be an arithmetic function and the function F is given by 


F(n) = Xan J (d). 


Show that 
Trak (m) = Zl Sl fan). 


Deduce that 

Um=iT(m) = Xi [F]. 
8.1.36 Let f be a function from N to an abelian group (A, +). Let F be a function 
from N to A defined by 


F(n) = Xan J (d). 


Show that 
f(a) = Land) F (4). 
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If the operation of the group is written multiplicatively, then if 
Fi) = [],,, f@. 


fo) = [],,¢G”. 


8.1.37 Show that 


Deduce that i 
n ie 
= dia, 
n=TII,,, 


8.1.38 Let f(X) € Z[X]. Define an arithmetic function 7 by w s(n) = the number 
of integers f(m), 1 < m <n such that (f(m),n) = 1. Show that + is multiplica- 


tive and 
Wr (p) 
ok ) recs a prime P 


8.1.39 Let a be an integer co-prime to n. Show that 


dua) 
TH, d =1(mod n). 


8.1.40 Let (n) denote the number of positive integers m <n such that m? is a 
generator of the cyclic group Z,. Find a formula for 7(7). Calculate 7(100). 


8.1.41 Show that 


Limnsco(Zmat gs) = Limnoo(Znar Ga) 


m=1 ms m=1 ms 


fors > 1. 


8.2 Higher Degree Congruences 


Let m be a positive integer greater than 1. We have already discussed linear Dio- 
phantine equations, linear congruences modulo m, or equivalently, linear equations 
in Z,,. Now, we consider higher degree congruences 


f(X) = aox” + aX"! 4... + a, = O(mod m), 
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where a; € Z and m does not divide ag, or equivalently, the polynomial equations 


F(X) = GX" + |X"! + + Gy = 0, 


where ap 4 0 in Z,,. Let 
Qi Q2 Qa; 
m= P; Po ---Prs 


where p), P2,... p, are distinct primes and a; > 0 for all i. The solutions of the 
congruence 
f(X) = O(mod m) 


are precisely the solutions of the simultaneous system 
f(X) = OGnod pi"), fX) = O(mod p§?),..., f(X) = O(mod pe’) 
of congruences. Further, if u; is a solution of 
f(X) = O(mod p;"); i =1,2,..-7, 


then, since 
X =uj;(mod p;") => f(X) = f (uj)(mod p;") 


(for (X — u;) divides f (X) — f(u;)), we need to find solutions of the simultaneous 
system 
X = u\(mod p}'),..., X =u,(mod pr). (8.2.1) 


of congruences. Thus, the problem of solving higher degree congruences reduces to 
the following two problems. 


Problem 8.2.1 (i) Does there exists a solution of the simultaneous system 
X =u\(mod p}'),..., X =u,(mod pe) 
of congruences, where p), p2,..., Pp, are distinct primes? 
(ii) If a solution exists, is it unique modulom = pj'p;... pe"? 


(iii) How to find solutions if they exist? 
Problem 8.2.2 (i) Does there always exist solution of 
f(X) = O(mod p"), 
where p is prime? If not, under what conditions a solution exists? 


Gi) Ifa solution exists, then how many distinct modulo p® solutions are there? 
(iii) How to determine the solutions? 
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Solution to Problem 8.2.1 is in affirmative, and this is precisely the Chinese 
remainder theorem. 


Theorem 8.2.1 (Chinese Remainder Theorem) Let {m,,m2,...,m,} be a set of 
pairwise co-prime integers. Let a,, az,..., a, be integers. Then the system of con- 
gruences 


X =a,(mod m,), X = ao(mod m2), ..., X = a,(mod m,) 


has acommon solution. If a and b are any two common solutions of the above system 
of congruences, then a = b(mod m), wherem = m,m2...m,. 


Proof Putn; = a Since {m1,m2,...,m,} 1s a set of pairwise co-prime integers 
(mj,nj) = 1foralli =1,2,...,r. 
By the Euclidean algorithm, we can find integers u; and v; such that 
ujm,; + vin; = |foralli=1,2,...,r. 
Putb; = vjn;, i=1,2,...,r. Then, bj = 1(mod m;), and b; = O(mod n;). Thus, 
bj = \(mod m;), and bj = O(mod m,) forall j Ai, i=1,2,...,4r. 
Take a = a,b, + aob. +---+a,b,. Then, 
a-—a = aby +daobo +--+ aj;_ybj_-, + (Bb — Ia; +--+ +a,b, 
is clearly divisible by m; for all i. Thus, 
a = a(mod m;) forall i, 


and so a is acommon solution to the given system of congruences. 
Further, since {7 1, m2,...,m,} 1S a set of pairwise co-prime integers, 


a = b(mod m) if and only if a = b(mod m;) for all i. 
Thus, there is a unique solution modulo m of the above system of congruences. { 


Corollary 8.2.2 Let {m,,m2,...,m,} be a set of pairwise co-prime positive inte- 
gers. Let {a,, a2, ..., a,} be a set of non negative integers such that a;_< m,; forall 
i. Then, there is unique smallest non negative integer t such that if we divide t by m; 
the remainder is a; for all i. 
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Proof From the above theorem, we can find an integer a such that a = a;(mod m;) 
for all i. Divide a by m, and let t be the remainder. Then, 0 <t < mand 


t =a;,(mod m;) forall i. 


This means that ¢ is such that if we divide t by m; the remainder is a; for alli. Clearly, 
t is the smallest such nonnegative integer. tt 


Example 8.2.3 In this example, we illustrate the algorithm of chinese remainder 
theorem by means of an example. We find the smallest positive integer a such that 
if we divide a by 3 the remainder is 2, if we divide a by 5 the remainder is 1, and if 
we divide a by 7 the remainder is 3. We first find a common solution of the system 


X =2(mod 3), X = \(mod 5), X = 3(mod 7) 
of congruences. Here, m, = 3, m2 = 5, m3 = 7, gy = 2, & = 
a; = 3.Thus,n} = 5x7 = 35, mn. = 3x7 = 2l,andn3 = 3x5 = 15. 
Now, (3,35) = 1. Using the Euclidean algorithm, we find that 


1 = 12x3 + (-1) x 335. 


Thus, b} = —35. Similarly, by = 21 andb3; = 15. Therefore, a common solution 
is given by 

ab) + agbp +.43b3 = 2b, + 1bo + 3b3 = —4. 
To find the smallest nonnegative solution, we divide —4 by m = mim2m3 = 105 


and take the remainder. Since 
—4 = (-1) x 105 + 101, 


101 is the required solution. 


Corollary 8.2.4 Let {m,, mz, ...,m,} be a set of pairwise co-prime integers, and 
m = mym,...m,. Then 


Zm © Zin, X Zn, X +++ X Zn, 
as rings. 
Proof Define a map f from the ring Z of integers to Zm, < Zm, X ++: X Zm, by 
f@ = G@G@,...,a), 
where @ at the ith place is the residue class of a in Z,,,. It is straightforward to see 


that f is a homomorphism of rings. Let (a7, a, ...,a@,-) be an arbitrary element of 
Zm, X Zm, X +++ X Zy»,. By the chinese remainder theorem, there is an integer a 
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such that a = a;(mod m;) for all i. Thus, a = Gj in Z,,, for all i. This shows that f 
is surjective. Suppose that f(a) = (0,0,...,0).Then,a@ = Oin Z,,, for alli. Thus, 


m;/a for all i. Since m,,m2,...,m, are pairwise co-prime, m/a, and so a € mZ. 
This shows that ker f = mZ. The result follows from the fundamental theorem of 
homomorphism for rings. ft 


Corollary 8.2.5. Under the hypothesis of the above corollary, 
Un 7 Um, x Um » ae Um, - 


Proof We know that the group of units of product of rings is the product of group 
of units of each of them. The result follows from the above corollary, if we observe 
that U(Zm) = Um. tt 


Another proof of multiplicativity of ®. Let (m;,m2) = 1. Then, from the 
above corollary, Un ~ Um, X Um,. Comparing the orders, we get that ®(mm2) = 
P(m))P(m2). t 

In general, a system of congruences need not have any common solution. For 
example, congruences X = O(mod 4) and X = 1|(mod 6) have no common solution 
(prove it). The following theorem gives us a necessary and sufficient conditions for 
a system of congruences to have a common solution. 


Theorem 8.2.6 Let {m,,m2,...,m,} bea set of nonzero integers. Leta, a2, ..., Ay 
be integers. Then the system 


X =a,(mod m,), X = a)(mod m),..., X = a,(mod m,) 


of congruences has a common solution if and only if (m;,mj)/(a; — a;) for all pair 
(i, j). Further, if a solution exists, it is unique modulo [m,,mz2,..., my]. 


Proof Suppose that we have acommon solution a of the given system of congruences. 


Then, for every pair (i, j), 4 = a;(mod m;) and a =aj;(mod mj). Leta = aj + 
mk; = aj + mjk;.Then, qi -— aj = m jk; = m;k;. This shows that (mj, mj)/(ai = 
aj). 


Conversely, suppose that (m;,mj;)/(a; — aj) for all i, 7. We show that there 
is a common solution a to the system of congruences which is unique mod- 
ulo [m1,m2,...,m,]. The proof is by induction on r > 2. We are given that 
(m1, M2)/(a, — az). Any solution of X = a;(mod mj) is of the form a; + km ,. Since 
(m1, M2) divides (ay — a,), there exists ak such that a; + km, = a> (mod mp). Fur- 


ther, we know from the theory of linear congruences that k is unique modulo aa ; 


mym2 


and so a solution a; + km, is unique modulo mae = [m,, m2]. 
Assume that the result is true for r. Then, we prove it forr + 1. Let m,, m2, ..., 
m,+, be nonzero integers together with integers a),d2,...,d;+,; such that 
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(m;,mj)/(a; — a;) for alli, 7. Then, we have to show the existence of a common 
solution to the system of congruences 


X =a\(mod m,), X = an(mod m2), ..., X = a-41(mod m,;+1). 


From what we have shown above, it follows that there is a common solution a of the 
pair of congruences 


X =a,(mod mj) and X = az(mod m2) 
which is unique modulo [m,, m2]. Consider the system 
X =a(mod [m,,m2]), X = a3(mod m3), ..., X = a,4,(mod m,41). 
Already (m;,mj;)/(a4, aj), 3<i<rt+1, 3<j<rtl. Since m,/(a—a), 
(m1, m;)/(a — a). Also, since (m1, m;)/(a; — a;), it follows that (m1, m;)/(a — 


a;). Similarly, (m2, m;)/(a — a;). But, then it follows that ([m,, m2], m;) divides 
a — a;. By the induction hypothesis, there is a common solution of the system 


X =a(mod [m,,m2]), X = a3(mod m3), ..., X = a-41(mod m,;+1) 
which is unique modulo [[771, m2], m3,...,m;4i] = [m1,m2,...,m,41]. Since 
solutions of X = a(mod [m,, m2)) are also solutions of the system X = a;(mod mj), 
X = a)(mod mz) of congruences, the result follows. tt 


Example 8.2.7 Here, again, in this example, we illustrate the algorithm of the above 
theorem by solving for a common solution of the system 


X = |(mod 9), X = 7(mod 15), X = 2(mod 10) 
of congruences and getting the smallest positive solution. Observe that the hypothesis 
of the above theorem is satisfied. 
First consider the pair 

X = |(mod 9) and X = 7(mod 15) 
of congruences. Any solution of the congruence equation X = 1(mod 9) is of the 
form 1 + 9a for some a € Z. If 1 + 9a = 7(mod 15), then 9a = 6(mod 15) or 
3a = 2(mod 5). Thus, a = 4, and so 37 is acommon solution of the pair 

X = |(mod 9)and X = 7(mod 15) 


of congruences. Now, [9, 15] = 45, and so we try to get a common solution to pair 


X = 37(mod 45) and X = 2(mod 10) 
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of congruences. Any solution of X = 37(mod 45) is of the form 37 + 45, and if it 
is also a solution of X = 2(mod 10), then37 + 453 = 2(mod 10), or equivalently, 
458 = —35(mod 10) = 5(mod 10). This means that 9G = 1(mod 2). Thus, G = 1, 
and 37 + 45 x 1 = 82 isacommon solution to the given system of congruences. 
Since [9, 15, 10] = 90, 82 is the smallest nonnegative solution. 


Because of the Chinese remainder theorem, the problem of solving higher degree 
congruence modulo n reduces to the problem of solving higher degree congruence 
modulo p®, where p is a prime. We further reduce this problem to the problem of 
solving higher degree congruence modulo p. 


Theorem 8.2.8 Letm > 1 and pa prime. Let 

fQ) = m4 aX + @X? + =e + aX" 
be a polynomial in Z[X], where p™*! does not divide dn. If x is a solution to the 
congruence 


f(X) =0(mod p™*), 


then it is also a solution of 


f(X) = 0(mod p"). 


If 2 is a solution of 
f(X) = O(mod p”), 


thena = 8 + ¥y- p™ isa solution of the congruence 
F(X) = O(mod p"*') 
if and only if y is a solution of the congruence 


Df(8)-X = —L2 (mod p), 


p” 


where Df (X) denotes the derivative of the polynomial f (X) (see Exercise 7.6.27). 


Proof Clearly, a solution of 
f(X) = O(mod p™*') 


is also a solution of 
f(X) = 0(mod p”). 


Let ( be a solution of 
f(X) = 0(mod p”). 


Then, p’”/f(@). By the Taylor’s theorem (see Exercise 7.6.29), 
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2. nf 
f(B+ 1") = FB + yw™DF(B) + p"P 2G +--+ Gey. 
E ee are integers for k > 1 (Exercise 7.6.28). Thus, 


f(B + yp") = (F(B) + yp" Df (8))(mod p”*"'). 


Hence, 
f(8 + yp") = 0(mod p™*') 


if and only if 
f(8) + yp" Df(B) = O(mod p™""), 


or equivalently, 
(Df (8) -7 = —4r(mod p). 


Corollary 8.2.9 The number of mod p™*! distinct solutions of 
f(X) = O(mod p™*") 


which are congruent to 3 modulo p" is 


(i) 0, if p divides Df (2) but it does not divide a. 


(ii) p, if p divides Df ({), and it also divides i, and 
(iii) 1, otherwise. 


Proof We know that aX = b(mod m) has no solution if (a, m) does not divide J, it 
has (a, m) distinct modulo m solutions if (a, m) divides b. The result follows from 
the above theorem. tt 


Remark 8.2.10 Theorem 8.2.8 gives us a method by which, knowing the solutions 
of f(X) = O0(mod p), we can determine (by induction) all solutions of f(X) = 
O(mod p’”), m € N. Thus, the problem of solving higher degree congruences reduces 
to the problem of solving them modulo different primes. 


Example 8.2.11 We illustrate the algorithm of the above theorem and corollary by 
solving the congruence 


X? + X? + X + 1 =O0(mod 27). 
First, let us look at the solutions of 
xX? + X? + X¥ + 1 = OGnod 3). 


Note that we have not yet described an algorithm to solve a higher degree congruence 
modulo p. However, for small p, we can try all integers modulo p. 
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We see that out of 0, 1, 2, only 2 is the solution of the above congruence. Thus, 
by the above theorem, 2 + y- 3 is a solution of 


Oe Ae = Oed S) 
if and only if 7 is a solution of 
DO? + X* 4+ X + DQ). xX =—2+22 +1 mod 3). 
Thus, yy should be a solution of 
17X =—S(mod 3) 


or equivalently 
2X = |(mod 3). 


Thus, y = 2 is the unique solution. Hence, 2 + 2 x 3 = 8 is the unique solution 
modulo 9 of congruence 


Pay Ys 1 SOned 9), 
Further, let 8 + 6-32 bea solution of 
Xb Xe XY | 1 j]OGned 3"), 
Then, again from the above theorem, 6 should be a solution of 
D(X? + X? + X + 18) = —- E+E 8+! (nod 3) 
or equivalently, 6 should be a solution of 
205X = —45(mod 3). 
This, in turn, means that 6 should be a solution of 
2X = 1|(mod 3). 


This gives the unique 6 = 2(mod 3). Hence, 8 + 2 x 3? = 26 is the unique 
solution modulo 27 of the given congruence. 


Example 8.2.12 We use the algorithms in this section to solve the congruence 


f(X) = 0(mod 135), 
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where f(X) = X? + X? + X + 1.Now, 135 = 3? x 5. Thus, we first solve 
the congruences 
f(X) =0(mod 33) (8.2.2) 


and 
f(X) = O(mod 5) (8.2.3) 


We have already seen in the above example that 26 is the unique solution of 1. 
Further, we find that 2, 3, and 4 are three distinct solutions modulo 5 of the congruence 
equation 8.2.3. Thus, there are three distinct (modulo 135) solutions of the congruence 


f(X) = 0(mod 135) 


which are common solutions of the following three pairs of congruences: 


1. X = 26(mod 27), X = 2(mod 5). 
2. X = 26(mod 27), X = 3(mod 5). 
3. X = 26(mod 27), X = 4(mod 5). 


Using the Chinese remainder theorem, we find that the solutions are 107, 53, and 
134. 


Exercises 
8.2.1 Give an example of a system of congruences which has no common solution. 


8.2.2 Find the smallest positive integer x such that if we divide x by 7 the remainder 
is 2, if we divide x by 10 the remainder is 3, and if we divide x by 9 the remainder 
is 4. 


8.2.3 Find the smallest positive integer x such that x = 3(mod 10), x = 7(mod 22), 
and x = —4(mod11). 


8.2.4 Find all distinct solutions modulo 343 of the congruence 
X? + 2X? + 3X + 35 =0(mod 343). 

8.2.5 Find all distinct modulo 49 x 17 = 833 solutions of the congruence equation 
x? 4 ON? 4 BX 4-95. = OGnod $33). 

8.2.6 Let f(X) € Z[X] be such that f(n) is eventually prime in the sense that 


there exists ang € N such that f(n) is prime for all n > ng. Show that f(X) is a 
constant polynomial. 


$.2.7 Letm = pj p,’ ..«p%, where pi, po, ...; Pr 
are all distinct primes. Suppose that 
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f(X) = 0(mod p*') 

has n; distinct (modulo D;") solutions. Show that 
f(X) = O(mod m) 


has nj -N2----+n, distinct (modulo m) solutions. 


8.2.8 Suppose that p is a prime which does not divide a, and also it does not divide 
n. Show that X” = a(mod p") has a solution if and only if X"” = a(mod p) has a 
solution. 


8.2.9 Let m € N be such that U,, is cyclic group. Let a € Z such that (a,m) = 1. 
Show that X” = a(mod m) is solvable if and only if 


@(m) 
q (1,P(m)) = 1(mod m). 


In particular, if p is a prime, then 


X" =a(mod p) 


has a solution if and only if 


p-1 
a™P-\) = (mod p). 


8.3 Quadratic Residues and Quadratic Reciprocity 


In Sect.8.2, we observed that to solve a higher degree congruence modulo m, we 
need to evolve a method to solve higher degree congruence modulo p, where p is a 
prime. In this section, we study quadratic congruences modulo p. 

There are two cases: 


(i) p = 2. 
(ii) p is an odd prime. 


Consider the case when p = 2. Let f(X) € Z[X] be a quadratic polynomial. 
Then, there are three possibilities 


(a) f(X) = (X? + X + 1)(mod 2). 
(b) f(X) = (X* + X)(mod 2). 
(c) f(X) = (X? + 1)(mod 2). 


If f(x) = (Xx? + X + 1)(mod 2), then Ff (X) = 0(mod 2) has no solution, for 
2 divides x? + x for all x € Z. If f(xoy= (X? + X)(mod 2), then 0 and 1 both 
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are solutions. Finally, if f(X) = (X 2 + 1)(mod 2), then 1 is the unique solution of 
F(X) = 0(mod 2). 

Now, we assume that p is an odd prime. Let 


F(X) =a + aX + aX? 


be a polynomial in Z[X] of degree 2 such that p does not divide a2. Since (p, az) = 
1, there exists a bz such that (b2, p) = 1, and bya, = 1(mod p). Further, since 


(b2, p) = 1, 
f (X) = bof (X)(mod p) = (X? + bra,X + bza)(mod p). 
Thus, any quadratic polynomial 
F(X) = ay + aX + aX’, 
where p does not divide az is congruent to a polynomial of the type 
xX? +bKX +c 


modulo p. Since p is an odd prime, (p,2) = 1 = (p,4). Thus, there exists an 
integer d such that d - 2 = 1(mod p), d?-4 = 1(mod p), and (d, p) = 1. Now, 


X? + bX + c= (X*? + 2dbX + b’d* — b?d? + c)(mod P= 
((X + bd)? + c—b’d?)(mod p). 


Thus, the solutions of 
X*? + bX +c =O(mod p) 


are same as the solutions of 
(X + bd)? = (b’d? — c)(mod p). 
This has solution if and only if 
Y* = (b’d* — c)(mod p) 
has a solution, and if u is a solution, then u — bd is a solution of 
ae ae O(mod p). 


Observe that there are at most two distinct solutions mod p. This motivates to have 
the following definition. 


Definition 8.3.1 Let m € N, m > 2. Let a € Z be such that (a,m) = 1. We say 
that a is a quadratic residue mod m if 
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X= a(mod m) 


is solvable. 


The following proposition follows as particular case of the discussions in the 
beginning of Sect. 8.2 of this chapter. 


ay a2 


Proposition 8.3.2. Letm = p\'p,’... p-’, where pi, p2, ... py are distinct primes, 
and a; = 1 foralli. Then a is a quadratic residue mod m if and only if a is quadratic 
residue mod p;" for all i. ¢t 


Proposition 8.3.3 Let p be an odd prime, and a an integer co-prime to p. Then a is 
a quadratic residue modulo p”, m > \ if and only if a is quadratic residue modulo 
p. Further, then there are only two distinct solutions modulo p™” of 


X? = a(mod p”). 


Proof The proof is by induction on m. If m = 1, the conclusion is the same as the 
hypothesis. Assume the result for m. We have to prove it for m + 1. Thus, X? = 
a(mod p") has a solution if and only if X 2 =a(mod Pp) has a solution, and then, 
there are exactly two solutions modulo p”. Note that X* = a(mod p’) means that 
(Xx? — a) =0(mod p’"), and also X? =a(mod p”*!) means that (xX? — a= 
O(mod p'*'). Applying Theorem 8.2.8 to the polynomial X?_ — a, we observe 
that corresponding to every solution 3 of (X* — a) = 0(mod p'”), there is a unique 
solution of (X? — a) = 0(mod p”*') if and only if p does not divide Df (3) = 28. 
Since 3 is supposed to be a solution of (X* — a) =O(mod p'"), and p does not 
divide a, it follows that p does not divide (3. Since p is an odd prime, it does not 
divide 2/3. The result follows by induction. tt 


The following proposition takes care of the case p = 2. 
Proposition 8.3.4 Leta be an odd integer. Then 


(i) X? =a(mod 2) has a unique solution mod 2. 
(ii) X? =a(mod 4) has a solution if and only if a = \(mod 4), and then it has 
exactly 2 distinct solutions mod 4. 
(iii) X? = a(mod 2”), m > 3 has a solution if and only if a = \(mod 8), and then 
it has exactly 4 distinct solutions mod 8. 


Proof (i) Clearly, 1 is the unique solution of X 2 = a(mod 2). 
(ii) If b is a solution of X? = a(mod 4), then since a is odd, b is also odd. Suppose 
thatb = | + 2/. Then, 


bP =1+4+ 414+ 4? =a(mod 4), 


and hence, 
a = |(mod 4). 
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Further, then out of 4 residue classes modulo 4, only I(mod 4) and 3(mod 4) 
are the solutions of 

X? = a(mod 4) = 1(mod 4). 

(iii) Suppose thatb = 1 + 21 is a solution of 


X? = a(mod 2"), m > 3. 


Then, 
1+ 41? + 41 =a(mod 2"), m>3 


or 
1 + 4142 +1) =a(mod 2”), m > 3. 


This implies that 
1 + 4172 +1) =a(mod 8). 


Since 2//(1 + 1), 
a = |(mod 8). 


Further, suppose that a = |(mod 8), then out of 8 residue classes modulo 8, 
1(mod 8), 3(mod 8), 5(mod 8), and 7(mod 8) are 4 solutions of 


X? =a(mod 8) = |(mod 8). 


Also, each solution of 
X*? =a(mod 2"), m>3 


is also a solution of 
X? = a(mod 8). 


Conversely, we show that each solution of 
X* = a(mod 8) 
determines a unique solution of 
X? = a(mod 2"), m > 3. 


This we prove by induction on m. If m = 3, there is nothing to do. Assume the 
result for m. Let b be a solution of 


X? = a(mod 2”). 


Then, b + 2”! is a solution of 
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X? = a(mod 2"*') 


if and only if 
(b + d2"-1)? = a(mod 2”) 
or 
be + 27"? + 2"\b = a(mod 2”*") 
or 


2"b = (a — b*)(mod 2"*'), 


Since 2” /(a — b?), 


Nb = 5" (mod 2). 


Since b is odd (for a is odd), there is a unique A(mod 2) satisfying the above con- 
gruence. Thus, corresponding to each solution of 


X? = a(mod 2"), 
we have a unique solution of 
X* = a(mod 2"). 
The result follows from the principle of induction. tt 


Theorem 8.3.5 (Euler’s Criterion) Let p be an odd prime, and a an integer such 
that p does not divide a. Then a is quadratic residue modulo p if and only if 


p—1 


! 
a 2 =I(mod p). 


Proof Suppose that a is a quadratic residue mod p. Then, there is an integer b such 
that 
b* =a(mod p). 


Since (a, p) = 1, (b, p) = 1. By the Fermat’s theorem, 
b?-| = 1(mod p). 


Hence, 
p-l 


p-l 
a2 =(b*) 2 (mod p) =b?~'(mod p) = \(mod p). 


Next, suppose that a is not a quadratic residue mod p. Let b be a positive integer less 
than p and so co-prime to p also. Then, 
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bX =a(mod p). 
has a unique solution modulo p. Since a is not a quadratic residue modulo p, this 


solution can not be b(mod p). Thus, there is anelementc 4b, | <c < p—1such 
that 


bc = a(mod p). 
This shows that we can pair the set {1,2,..., p — 1} such that product of any pair 
is congruent to a(mod p). Taking the product of 1, 2,..., p — 1, we obtain that 


(p-)! = ae Gea p)- 


By the Wilson theorem, (p — 1)! = (—1)(mod p). Hence, if a is not a quadratic 
residue, then 


p-1 
a 2 =(-1)(mod p). tt 


The following corollary follows from the previous 4 results. 


Corollary 8.3.6 Letm = 2” p5? p;°... p°", where p2, p3,..., py are distinct odd 
primes, a, = 0, and a; => | foralli > 2. Let a be an integer such that (a,m) = 1. 
Then a is a quadratic residue modulo m if and only if 


(i) a'r = (mod p;) forall i > 2. 
(ii) a = 1\(mod 4) ifa, = 2, anda = \(mod 8) if ay => 3. 


Further, 
X= a(mod m) 
has 2'—' solutions, if ay <1, 2” solutions, if a; = 2, and 2'+! solutions, if 
a, > 3. t 


The rest of the section is devoted to develop a method by which we can determine 
whether a given integer a is a quadratic residue mod p, p an odd prime. 


Definition 8.3.7 Let a be an integer and p a positive prime. Define the Symbol CG) 
as follows: 


(i) ) = lif p does not divide a, and a is quadratic residue modulo p. 
(ii) G) —1if p does not divide a, and a is not a quadratic residue modulo p. 
(iii) ee = Oif p divides a. 


Let P denote the set of all positive primes. The map (—) from Z x P to {1, 0, —1} 
defined by (—)((a, p)) = CG) is called the Legendre symbol map. 


Proposition 8.3.8 Let p be an odd prime. Then 
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(i) at = (4)(mod p) for alla € Z. 
(ii) a = b(mod p) implies that Ce = (2) for all a, beZ. 
(iii) (2) = (£)- (4) foralla,b € Z. 


Proof If p/a, then at = O(mod p), and by the definition, ee = 0, and so in this 


case, ar= (5) (mod p). Suppose that p does not divide a. Then, by the Euler’s 
theorem, we have the following: 
(a) If a is a quadratic residue mod p, then a = l(mod p). 


P= 


(b) If a is not a quadratic residue mod p, then a T= l(mod p). 
q P P 


By the definition of the Legendre symbol, it follows that 


Ge = (5) (mod Dp). 


(i) Since a = b(mod p) ifand only ifat = pr (mod p), the result follows from 
part (i) 

(ii) (ab) = a'r br = (4)(4)(mod p). Also, (ab) = (2)(mod p). Since 
the absolute value of any Legendre symbol is less than or equal to 1, the result 
follows. tt 


The following corollary follows from the part (ii) of the above proposition. 


Corollary 8.3.9 ee = (5) where r is the remainder obtained when a is divided 

by p. t 
The following corollary is immediate from part (iii) of the proposition. 

Corollary 8.3.10 Product of any two quadratic residue mod p is a quadratic residue 


mod p. Product of any two nonquadratic residue is a quadratic residue. Product of 
a quadratic residue and a non quadratic residue is a nonquadratic residue. tt 


Corollary 8.3.11 We have a surjective homomorphism from Z*, = Uy, to the group 
{1, —1} given bya ~ (5) 


Proof Suppose that a = b. Then, a = b(mod p), and so (3) — (2). Hence, we 
have a map given by a ~ (5). Also since 


(2) = (4)(), 


p—-1 
2 


distinct members a of U, 


the map is a group homomorphism. Clearly, () = 1. Further, there are distinct 


p-1 
2 
for which (3 = |. For the rest of ao elements a of U,, 9) = —1. Hence, the 


given map is surjective. ft 


p—1 _ 
solutions of X‘T = TinU p- Hence, there are only 
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Corollary 8.3.12 (+) = (-— ye. In particular, —\ is a quadratic residue mod p 
if and only if p is of the form 4n + 1. 


p-l p-l 


Proof From the Euler’s theorem, (=) = (—1)7. Further, (-1) 7 = 1 ifand 
only if p = 4n + 1 for some n. tt 


Proposition 8.3.13 There are infinitely many primes of the form 4n + 1. 
Proof 5 is a prime of the form 4n + 1. Suppose that there are only finitely many 
primes pj, po,..., py of the form 4n + 1. Consider 


a = (2pip2... py)? + 1. 


By the fundamental theorem of arithmetic, there is a prime p dividing a. Clearly, p 
is odd, and 
(2pip2... py)? =—I1(mod p). 


This shows that —1 is a quadratic residue modulo p. By the above corollary, p 
is of the form 4n + 1. Clearly, p € p; for all i. This is a contradiction to the 
supposition. ft 


Lemma 8.3.14 (Gauss Lemma) Let p be an odd prime, and a be an integer such 
that p does not divide a. Consider the group U, of prime residue classes modulo p. 
Let 


and 

Then 
a4) — (_1)laxn yl 
(2) = (1x, 


Proof We have 


x U Y = Uy. (8.3.1) 
and 
-Y=xX. (8.3.2) 
Also, 
aX = aX(\U, = @X{)X)UJ@X{)¥). (8.3.3) 
By (8.3.2), —(@X (| Y) © X. Leth € @X (1) X) (1) —@X ()Y). Then, b = amy = 
—amz, where 1 < m,,m < >". But, then a(m; +m) = 0. Since (a, p) = 


1, my +m, = 0. Also, 2 < m, +m 2 < p — 1. This is a contradiction. Thus, 


(@X (|X) (\-@x(¥Y) = & (8.3.4) 
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Counting the number of elements, we get that 
X = aX a X) U —(aX a ae 


Taking the product of the elements of both sides 


a 1 ———— 


1-2-...2= = 1.2..-. 


et 
pal gz. (—1exnn, 


Hence, 
a. p-1 —___ 
l=a2 -(-1)e@*Ny"l 


or equivalently, 


1 SS 
a2 = (—DiaexN¥1, 


Thus, 
p—1 


L _ 
az =(-1)**"lqnod p). 
By the Euler’s theorem, the result follows. t 


Corollary 8.3.15 (4) = lea 


Proof Let X and Y be as in the above lemma. Then, by the Gauss lemma, 
G= Gy, 
where \ is the number of elements in the set 2X (| Y. Now, 


2X(\¥ = Qm| 2 <m< 5}. 


Thus, \ is the number of integers m such that eo <m< pe. Since p is an odd 
prime, one and only one of the following four cases hold. 


(i) p = 8 +1, k>1. 
(ii) p = 8k + 3, k>0. 
(iii) p = 8k + 5, k>0. 
(iv) p = 8K +7, k>0. 


Consider the case (i). In this case, \ is the number of integers m such that Seyiat < 


m < “41 This is clearly 2k. Thus, in this case (2) = (—1)** = 1. Also in this 


case, (—1) Fe i, Therefore, in case (1), the result is true. 
In case (ii), p = 8k + 3, and then, \ is the number of integers m such that 
2k + 5 < m<4k + 1. Thus, \ is the number of integers m such that 2k + 1 < 
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m <4k + 1.Inthis case,’ = 2k + 1, andso (2) = (—1)**+! = —1, Further, 


8k-+3)2—-1 : a ‘ . 
(— 1) 53 = —1. Therefore, in case (ii) also, the result is verified. 
8k +4 


In case (iii), p = 8k + 5, and A is the number of integers m such that ~j < 
m <4k + 2.Thus, is the number of integers m such that2k + 1 < m < 4k 4 2. 


(8k4+5)2=1 


Thus, in this case, 1 = 2k + 1, and (2) = —1 = (-1) 3. Hence, in this 
case also, the result is true. 
Finally, consider the case (iv) when p = 8k + 7. In this case, \ is the number 


of integers m such that < m<4k + 3 which is the same as the number 
of integers m such that 2k + 2<m<4k + 3. Clearly, \ = 2k + 2, and so 


(8k+7)2=1 


(2) =1z=(-l)''3 — (-1)"*. This completes the proof. tt 


8k + 6 
4 


As an application of the above result, we have the following: 


Corollary 8.3.16 There are infinitely many primes of the form 8k + 7. 


Proof Clearly, 7 is a prime of this form. Suppose that there are only finitely many 
primes pj, p2,..., py of the form 8k + 7. Considera = (4p\p2...p,)* — 2. 
Clearly, 2 is a quadratic residue modulo every prime divisor of a. Obviously, there 
are odd prime divisors of a. It is clear from the proof of the above corollary that 2 is 
a quadratic residue modulo an odd prime p if and only if p is of the form 8k + 1 or 
itis of the form 8k + 7. Since product of numbers of forms 8k + 1 is also numbers 
of same form, if all odd prime divisors of a are of the form 8k + 1, then 


a = l6p{p3...p? — 2 = 2(81 + 1) 


for some /. This implies that 8t2 — 1 = 8/ + 1,wheret = pip2...p;. This isa 
contradiction, for it would mean that 8 divides 2. Hence, not all odd prime divisors 
are of the form 8k + 1. Thus, there is a prime divisor of a of the form 8k + 7. 
Clearly, this prime is different from p;, p2,... p-. This again is contradiction to the 
supposition that there are only finitely many primes of the form 8k + 7. tt 


Remark 8.3.17 The above result is a particular case of a more general theorem due 
to Dirichlet which says that if a and b are co-prime, then there are infinitely many 
primes of the form ax + b. 


Corollary 8.3.18 Let p be an odd prime. Then, 2 is a generator of U, in each of the 
following two cases: 


(i) p is of the form4q + 1, where q is an odd prime. 
(ii) p is of the form2q + 1, where q is a prime of the form 4k + 1. 

Also —2 is a generator of U,, if p is of the form 2q + 1, where q is a prime of 
the form 4k + 3. 


Proof p = 4q + 1. Let m be the order of 2 in U,. Then, m/(p — 1). Since g is 
an odd prime, m = 2,4,q,2q or 4q.Ifm = 2, then = l(mod p). But, then 
p = 3 is not of the form 4g + 1.Ifm = 4, then 24 = 1(mod p). But, then 
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= Sis not of the form 4g + 1 with g isa prime. Ifm = q orm = 2d, then 


= l(mod p). But (2) — (-1) 5 = —1. This is a contradiction to 
the Euler’s theorem. This shows that m = p — 1. The rest of the corollary can be 
proved similarly and is left as an exercise. ft 


The following theorem known as Gauss Quadratic Reciprocity Law is 
extremely useful to calculate the Legendre symbol and, in turn, to determine whether 
an integer is quadratic residue modulo m. This is the Golden theorem of Gauss. There 
are several proofs of the theorem. We give a combinatorial proof. 


Theorem 8.3.19 (Gauss Quadratic Reciprocity Law) Let p and q be odd primes. 
Then 


p-1 
P) (4) — gL 
()-(@) = Cn? -s 
Proof By the Gauss lemma, 


=e 


and 
A=Ene™, 


where 


are subsets of U,, 


and 


fs pe ag = 


are subsets of U;. Put’ =| q@X (| Y | andy =| pZ()T |. Itis sufficient to show 
that 
A+ p= &. £*(mod 2). 


Let 
-1 
= Re ee 
and 
B = {1,2,..., 5+}. 
Let 


Dp 


= {(x, y)€ Ax B|-% < qx—py < O}, 
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QO = {(x,y)€ Ax B| -£ < py—qx < 0}, 
R = {(x,y)€ Ax B|qx-py < ==}, 


and 
S = {(x,y)€ Ax B| py—qx < —4}. 


Clearly, the sets P, Q, R, and S are pairwise disjoint. We show that 


Ax B= P\JolUJRUs 


It is easily observed that 


PUJoURUs cAxe. 


Let (x, y) € A x B. Then, gx — py € 0 for otherwise x has to be at least p and y has 
to be at least g. Also, since gx — py is aninteger and p and gq are odd, qx — py # He 
and qx — py A +4. Thus, 


(qx — py) € (-00, -$) J(-$.0 UJ@, $) UG, 0). 


Now, 
(qx — py) € (-~w, —$) = G@ YER, 
(qx — py) € (—§, 0) => @, y) e€ P, 
(qx — py) € (0.4) = @, y)E Q 
and 


(qx — py) € (£,~) => (x, y) ES. 


This shows that 


AxB= P\\0\ | | 18. 


In turn, 
Pt =| PL + 1Q1+ 1 RI+ 151. (8.3.5) 
Now, suppose that (x, y) € P.Then,—% < qx — py < Oorequivalently 7+" < 
qx — py + p < p—1. Hence, gx = qx — py + p belongs to Y. Thus, we have a 
map ¢ from P to gX (| Y defined by 


P(X, y)) = gx. 
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Suppose that d((x, y)) = ((x’, y’)). Then, g¥ = qx’. This means that p divides 
qx — qx’. Since q is co-prime to p andl <x <p—1,1<x'<p-l,x =x’. 
Hence, (x, y), (x, y’) € P. But, then —? < (gx — py) < Oand—4 < (qx—- 
py’) < 0 or equivalently 


q q 1 q y q 1 
ty < < +x 5andtx < < +x 5. 
7 y z + 5 : y = + 5 


Hence, y = y’. This shows that ¢ is injective. 
Let gx € GX (| Y, where 1 <x < Po Then, there exists r such that gx = 7, 


where aot <r<p-—1.Sincegx = r = r-— p, there exists a y € B such that 


qx — py = r-— p. Clearly, then 


— 5 < (qx— py) < 0, 


and so 
a g 1 
eo pe oF 3: 


Already, 1 < x < ae l<y< aot This shows that (x, y) € P, and @((x, y)) = 
qx. Thus, ¢ is also surjective. Hence, 


ASPs (8.3.6) 


Similarly, 
pb =|Q|. (8.3.7) 


Consider the map 7) from A x B to itself defined by 


"Gy = CP - 2,4" =». 
This is clearly a bijective map. Also, 


(x, y) € R ifand only if (py — qx) > &, 


and 


(py — qx) > & ifand only if p(&* — y) — q(@4Ae-x) = 
ot = py gs) = =F. 


Thus, 
(x, y) € R ifand only ifn((x, y)) € S. 


This shows that 
|R|=|S|. (8.3.8) 


From (8.3.5) to (8.3.8), we obtain that 
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po. dt = (A + p)(mod 2). 


The result follows. tt 


The following example illustrates an algorithm to determine the Legendre symbol 
) and thus to determine whether a is a quadratic residue mod p or not. 


Example 8.3.20 We determine the Legendre symbol (223) and thereby determine 


whether 2125 is a quadratic residue modulo 641 (observe that 641 is a prime number 
which is the smallest prime dividing 2 4 1). Also, we find the remainder when 
2125°° is divided by 641. 

If we divide 2125 by 641, the remainder is 202. Hence, by Corollary 8.3.9, 


Cosi) = (2). By Proposition 8.3.8(iii), we have (277) = (2,)(#1). By Corollary 
—? 6417-1 
@=CD & =1. 
Hence, 
(235) = (. (8.3.9) 
By the Gauss quadratic reciprocity law, 
101-1 641-1 
).(4) = Cnr 2 =1. 
Hence, 
101 641 35 5 7 
Cai) = Gor) = Gor) = Gor: Gor: (8.3.10) 
Again by the quadratic reciprocity, 
oe ee eta 2 
Go): () = CD = 1. 
Hence 
5 101 1 
Gp =(QI=@Q=1. (8.3.11) 
By the quadratic reciprocity, 
7-1 101-1 
GG) = GD? F =1 
Hence, 
(gr) = CY) = G). (8.3.12) 
Also 
3) 7 Ele 
@-@ = (prt =-1 


Hence, 
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G) = -G) = -G) = -1. 


Thus, 
(o) = HL. (8.3.13) 


Substituting the value of (3) and (eo) in Eq. 8.3.10, we get 


101 
From Eq. 8.3.9, we get 
2125 
wa 


This implies that 2125 is not a quadratic residue modulo 641. By Proposition 8.3.8 
(i), 


641-1 
(2125) 2 = (225)(mod 641) = —1(mod 641) = 640(mod 641). 


Thus, if we divide (2125)>”° by 641, the remainder is 640. 
Proposition 8.3.21 Let a be an integer and p an odd prime such that p does not 
divide a. Then 

aX” + bX +c =O(mod p). (8.3.14) 
has a solution if and only if b? = 4ac(mod p), or else 

b?—4ac\ __ 
(Faaey = 1. 
Further, in the second case it has two distinct solutions. 
Proof Since p does not divide a, there is an integer u co-prime to p such that 
ua = 1(mod p). (8.3.15) 

and so the solutions of the given Eq. 8.3.14 are same as those of 


uaX*? + ubX + uc = O0(mod p). 


The given Eq. 8.3.14 has a solution if and only if the above equation has a solution. 
Since ua = |(mod p), 


uaX*? + ubX + uc = (X* + ubX + uc)(mod Pp). 
Thus, the given Eq. 8.3.14 has a solution if and only if 


X* + ubX + uc =O0(mod p). (8.3.16) 
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has a solution, and then, they have same solutions. Since p is an odd integer, there 
is an integer v such that 


2v = |(mod p). (8.3.17) 
But, then 
X? + ubX + uc =(X* + 2vubX + uc)(mod p). (8.3.18) 
Now, 
X? + 2vubX + uc = (xX + vub)” _ (v*u7b? — UC). 
Thus, 


X? + 2vubX + uc = O0(mod p) 
has a solution if and only if 
Y? = (v7u7b* — uc)(mod p) (8.3.19) 


has a solution, and then, they have same number of solutions. Now, Eq. 8.3.19 has 
unique trivial solution if 


rr = w= O(mod p), 
and has two distinct solutions if and only if 
a) = 1. (8.3.20) 
From (8.3.15), a7u? = 1(mod p), and from (8.3.17), 4v? = 1(mod p). Hence, 
4v°a?u?b* = b*(mod p), (8.3.21) 


and 
4a*uc = 4ac(mod p). (8.3.22) 


Also, since 4a? is co-prime to p, from (8.3.21) and (8.3.22), we see that 
(v-u-b? — uc) = (b* — 4ac)(mod DP). 


It follows that (8.3.19) has a unique solution if and only if b? = 4ac(mod p), and it 
has two distinct solution if and only if 


(z — ae) = 1. t 


Since (8.3.14) and (8.3.19) have same set of solutions, the result follows. 
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Example 8.3.22 To determine the number of solutions of 
X? = 13(mod m) (8.3.23) 


where m = 2? x 3? x (17)3. Since 13 = 1(mod 4), from Corollary 8.3.6, it follows 
that (8.3.23) has a solution if and only if 


3-1 
(13) 2 = 1(mod 3), 


and 17-1 
(13) 2” = 1(mod 17). 
Equivalently, 
(3) = 1 = (). 
Now, 


()=@s1 
By the Gauss law of quadratic reciprocity, 


13 7 iS 
dD = Cpr? =1. 


Hence, 


G = =@ = G4’ =1. 


Thus, X? = 13 (mod m) has a solution, and again by Corollary 8.3.6, there are B= 8 
distinct solutions modulo m. The reader is asked to find them. 


Example 8.3.23 Let p be a prime dividing X* — X + 1. Then, p = I(mod 12) 
or p = 7(mod 12). 


Proof Since X27 — X = X(X — 1) is even, X? — X + 1 is always an odd 
integer. Hence, any prime p dividing X7 — X + 1 is an odd prime. Let p be a 
prime dividing X* — X + 1, then since p is odd, (2, p) = 1. Hence, there is an 
integer / such that 

21 = |(mod p). (8.3.24) 


In turn, 
(CP =k 4 1) = = 2X4 Did p). 


Thus, p divides X? — 21X + 1also. Since 


= 3 1 =] (HP = FHT, 
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(X — 1)? = (? — 1)(mod p). 


This shows that /? — 1 is a quadratic residue modulo p, and hence, 


Since (4) = (3)? = 1, we have 


a 
GE) = 1. 


Since 2/ = 1(mod p), (41? — 4) = —3(mod p). Hence 1 = (3) = (>) Ci 


Thus, 
p-1 


=Cy=Cp2. (8.3.25) 


By the Gauss quadratic reciprocity law, 


p-l| 3-1 


-1 
@)-@ = Cyt Ft =Cy? =. 


Hence 
(f) = 1. (8.3.26) 


Since p is an odd prime, p = 1(mod 3) or p = 2(mod 3). Since (3) — ei = 
—1, we have 
p = |(mod 3). (8.3.27) 


Again, since p is an odd prime, p = |(mod 4) or p = 3(mod 4). In case p = 
l(mod 4), 4/(p — 1), and also by (8.3.27), 3/(p — 1). But, then 12 divides p — 1. 
This means that p = 1(mod 12). In case p = 3(mod 4), since p = |(mod 3), by 
the chinese remainder theorem, p = 7(mod 12). tt 


Example 8.3.24 Let Z x Z denote the set of points on the plane with integral coor- 
dinates. Then, the parabola 
xX? + 19Y = 2 


does not intersect Z x Z. 


Proof To say that the given parabola intersects the lattice Z x Z is to say that X* + 

19Y = 2hasanintegral solution. Equivalently, X* = 2(mod 19) will have solutions. 

But r 

(19)?-1 
8 


(4) = (-D = -l. dt 
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Example 8.3.25 To determine primes p for which 10 is a quadratic residue modulo 
p. To say that 10 is a quadratic residue modulo p is to say that 


Thus, 10 is a quadratic residue modulo p if and only if one of the following holds. 
) (2 — (5 
® @ = 1 = G@). 

(i) 2) = -1 = (). 


Consider the case (1): 


(2 


P 


) = 1 ifand only if p = \(mod 8) or p = 7(mod 8). (8.3.28) 


By the Gauss quadratic reciprocity law, 


p-l 5-1 


@-=Cyp?r? =1. 
Hence, 
) _ (2). (8.3.29) 
Thus, 
(5) = 1 ifand only if () = 1. 
Now, 


(2) = | ifand only if p = \(mod 5) or p = 4(mod 5). (8.3.30) 
By the Chinese remainder theorem, 
[p = |(mod 8) and p = \(mod 5)] if and only if p = \(mod 40), 
[p = |(mod 8) and p = 4(mod 5) if and only if p = 9(mod 40), 
[p = 7(mod 8) and p = \(mod 5)] if and only if p = 3\(mod 40), 
and 
[p = 7(mod 8) and p = 4(mod 5)] if and only if p = 39(mod 40). 


Thus, in case (i), p is congruent to 1, 9, 31, or 39 modulo 40. 
Consider the case (ii): 


(2) = —1 ifand only if [p = 3(mod 8) or p = S(mod 8)]. 


Pp 
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5 : ‘ 
Also, (3) = —1 ifand only if (2) = —land 
(2) = —1 ifand only if |p = 2(mod 5) or p = 3(mod 5)]. 


Again using Chinese remainder theorem, we find that in this case p is congruent to 
3, 13, 27, or 37 modulo 40. This shows that 10 is a quadratic residue modulo p if 
and only if when p is divided by 40 the remainder is one of the 1, 3, 9, 13, 27, 31, 
37, or 39. 


Exercises 

8.3.1 Determine whether 150 is quadratic residue modulo 131. 
69 

8.3.2 Compute (3). 


8.3.3 Find the remainder when (60)*° is divided by 71. 


8.3.4 Determine whether the congruence equation 
40X? + 12X + 6 = 0(mod 23) 


has a solution. Determine the number of distinct solutions modulo 23, if it has any. 


8.3.5 Determine whether 221 is a quadratic residue modulo 4 x 7 x (13)3. Find 
the number of distinct solutions of 


X? = 221(mod 4 x T° x (13)°), 
if it has any. 
8.3.6 Determine whether 
12" + 7X + 3. =O0Gnod 511225) 


has a solution. How many distinct modulo 511225 solutions are there? 


8.3.7 Show that the number of distinct solutions of X? = a(mod p) is 1 + (5), 
where p does not divide a. 


8.3.8 Show that D/7)(4) = 0. 
8.3.9 Show that if (p,a) = 1, then 
bin (2) = 0. 


8.3.10 Let p be a prime dividing X* — X* + 1. Show that p = 1(mod 12). 
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8.3.11 Determine whether the parabola Y* = 641X + 3 intersects the lattice 
Zx i, 


8.3.12 Determine primes p for which 15 is a quadratic residue mod p. 


8.3.13 Suppose that a is a quadratic residue modulo all but finitely many primes. 
Show that a is a perfect square. 


8.3.14 Show that the ellipse X7 + 5Y? = p where p: is a prime intersects the 
lattice Z x Z if p = 1(mod 20) or p = 9(mod 20). 


8.3.15 Suppose that (=8) = 1. Show that the ellipse X? + 10¥? = p? intersects 
the lattice Z x Z. 


Chapter 9 
Structure Theory of Groups 


9.1 Group Actions, Permutation Representations 


Let G be a group with identity e and X a set. Amap*: Gx X —~> X is called an 
action of G on X if 


(i) x((e,x)) = x forall x in X, and 
(ii) *((gigo,x)) = *((g1, *((g2, x)))) for all gi, g2 in G and x in X. 


If we denote the image «((g, x)) by g x x, then the conditions (1) and (11) read as 


(i) exx = x forall x in X, and 
Gi) (g1g2)*x = gi *(g2*x) for all gj, g2 in G and x in X. 


We say that G acts on X through the action *. We also say that X is a G-set under 
the action x. 

Before having some interesting examples and applications of group actions, we 
shall develop the theory of group actions to some extent. 

Suppose that G acts on X through an action x. Let g € G. Define a map f, from 
X to X by fy(x) = g*x. Suppose that fj(x1) = fj(x2). Then, g* x, = g* x2. 
This implies thatx) = ex*x, = (g°'g) «x, = gl «*(g«x) = gui «(g%x) = 
(g-'g) «x2 = @xx2 = Xx2.Thus, fj is injective. Next, ify € X,then f,(g7! «y) = 
gx(g-'xy) = (gg7')*y = exy = y. Hence, f, is also surjective. This shows 
that f, € Sym(X). 

Define a map p from G to Sym(X) by p(g) = fy-Now, p(gi92)(*) = fog) = 
(gigz*xX = gir (go*x) = fg (fn) = (P(gep(g2))() for all 91, 92 € G 
and x € X. Hence, p(gigo) = (p(g1)ep(gz2)) for all 91, g2 € G. This shows that p 
is ahomomorphism from G to Sym(X). Such a homomorphism is called a permu- 
tation representation of G on X. Thus, given an action * of G on X, we have a 
representation of G on X. Conversely, given any representation p of G on X, we 
have an action * of G on X defined by g * x = p(g)(x) such that the corresponding 
representation is the same as p (verify). 
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Thus, there is a faithful way to view an action as a representation, and a represen- 
tation as an action, and they are related by 


g*x = p(g)(x). 


Let x be an action of G on X with the corresponding representation p. Then, the 
kernel of pis {g € G| p(g) = Ic} = {g€ G| p(g\(x) = x forall x € X} = 
{g€G|gx*«x = x forall x € X}. Therefore, {g ¢G|g«*«x = x forall x € X} 
is anormal subgroup of G. This subgroup is called isotropy group or the stabilizer 
the of the action « of G on X, and it is denoted by Stab(G, X). 

By the fundamental theorem of homomorphism, we have the following proposi- 
tion. 


Proposition 9.1.1 G/Stab(G, X) is isomorphic to the subgroup p(G) of Sym(X), 
where p is the corresponding representation. ft 


Proposition 9.1.2 Let * be an action of G on X and x € X. Then, G, ={g EG | 
g*x = x} isa subgroup of G. 


Proof Sinceexx = x,e € Gy. Leta,be G,. Then,axx = x = bDxx. But, 
thenx = exx = a! x(axx) = a! x(bxx) = (a~'b) «x. This shows that 
a'be G,. tt 


The subgroup Gy = {g€ G|gx*x = x} of Gis called the isotropy group at 
x or the stabilizer of x. This is also called the local isotropy group at x. Clearly, 
Srab(G, X). =" |yex Ge. 

Let G be a group which acts on X. Define a relation ~ on X as follows: 


x ~ yifand only ifgxx = yforsomeg eG. 


The relation ~ is reflexive, fore x x = x.Itis symmetric, for g «x = y implies that 
g |*y = x. Itis transitive, forg*x = yandh«xy = zimpliesthathgxx = z. 
Thus, ~ is an equivalence relation. 

The equivalence class of X modulo ~ determined by x is {ye X |x ~ y} = 
{g*x | g € G}. This set is denoted by G « x, and it is called the orbit of the action 
through x. 

From the properties of equivalence relations and equivalence classes (see Propo- 
sition 2.4.4), we have the following: 


Gy) x= Le am 
(ii) Distinct orbits are disjoint. 
(iii) G* x and G * y are same if and only ifg*x = y forsome g é€ G. 


The following proposition relates the orbit G « x through x and the isotropy group 
G, atx. 


Proposition 9.1.3 Let * be an action of G on X. Then, there is a bijective map from 
the set G/'G, of left cosets of G modulo G,, to the orbit G * x through x. 
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Proof Suppose that gG, = hG,. Then, g-'h € Gy. In turn, g''thxx = x. It 
follows that gxx = h*x. Thus, we have a map f from G/'G, to G*x defined 
by f(gG,) = gx. Clearly, this mapping is also surjective. Next, suppose that 
f(gGx) = f(hG,). Then, g*xx = hxx, or equivalently, g th € G,. Conse- 
quently, gG, = hG,. Thus, f is also an injective map. tt 


Corollary 9.1.4 G,. is of finite index in G if and only if G x x is finite. Further, then 
[G : G,] =| Gxx |. In particular, if G is finite, then [G : G,] =| Gx*x | forall 
x € X, and so | G xx | divides | G | forall x € X. Also, if X is finite, then isotropy 
subgroups at all points are of finite indexes, and|G : Gy] =| Gx*x |forallx € X.t 


Let G be a group which acts on X through an action x. An element x € X is 
called a fixed point if g* x = x for all g e G. Thus, x is a fixed point if and 
only if G, = G, or equivalently, G*x = {x}. The set of all fixed points of the 
action x is denoted by X%, and it is called the fixed point set of the action. Thus, 
X° = {xe X | gxx = x forall g€ G}. 

Anelement x € X isa fixed point if and only if the orbit G * x is singleton. Thus, 
X© is the union of all those orbits which are singletons. 

Let A be a set obtained by choosing one and only one member from each orbit 
different from singleton. Thus, ifx € A,thenG *x ¢ {x}. Further,G*xf|Gey = 
0 for allx,y € A,x # y. Since X is union of all orbits, 


x=x°U(U__, G+) (9.1.1) 


Suppose that X is finite. Then, since distinct orbits are disjoint, we have 
|X|=|X°|+2xca] Gx x | (9.1.2) 


Observe that each term under summation in the above equation is greater than 1. By 
Corollary 9.1.4, we have 


|X| =| X°| +2xealG : Gy] (9.1.3) 


Here again, each term under summation in R.H.S. is greater than | and divides | G | 
(Lagrange theorem). 
Equation (9.1.3) is called the class equation or class formula for the action x. 


Proposition 9.1.5 Let G be a finite group of order p",n > 1, where p is a prime. 
Let X be a finite set on which G acts. Then, 


| X | =| X° | (mod p). 
Proof Consider the class formula 


|X| =| X°| + Dele: G,] 
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of the action, where each term under summation in the right hand side is greater than 
1 and also divides | G | = p” (by the Lagrange theorem). This shows that the second 
term of the right hand side is a multiple of p. The result follows. tt 


Corollary 9.1.6 Let G be a finite group of order p",n > 1, where p is prime. Let 
X be a finite set containing m elements on which G acts. Suppose that p does not 
divide m. Then, X° # @. 


Proof From Proposition 9.1.5,m =| X |=| X© | (mod p). Since p does not divide 
m,| X° |#0. This means that X° # J. t 


So far, we developed elementary theory of group actions. Now, we give some 
examples and applications to the structure theory of finite groups. 


Example 9.1.7 Let G be a group and X the set part of G. Take « to be the binary 
operation of G. Then, G acts on G. This action is called the left multiplecation of 
G on G or left regular action of G on G. The isotropy group G, at x is given by 
G, = {g € G| gx = x} = {e} deft cancellation law). The stablizer Stab(G, G) 
of the left regular action of G on G is also {e}. The representation determined by 
left regular action of G on G is the homomorphism p of the Cayley’s theorem (see 
Theorem 6.2.21). Clearly, p is injective. The representation p is called the regular 
permutation representation of G. 


Example 9.1.8 Let G be a group and X = G. Define gxx = gxg™', g,x €G. 
It is easy to verify that * is an action of G on G. This action is called the inner 
conjugation of G on G. The isotropy group of the inner conjugation of G on G at 
xé€GisG, = {g€¢G|gxg! = x} = {g €G| gx = xg}. Recall (Definition 
4.4.15) that this subgroup of G is called the centralizer of x in G, and it is denoted 
by Cg(x). An element a € G is called a conjugate of an element b € G if there is 
an element g € G such that gag~! = b. Thus, the orbit through x € G of the inner 
conjugation of G on G is the setG*x = {gxg~! | g € G} of all conjugates of x in 
G. This is called the conjugacy class of G determined by x, and it is denoted by x. 


Corollary 9.1.4 applied to the inner conjugation of G on G gives the following: 


Corollary 9.1.9 Let G be a finite group. Then, the number of conjugates to an 
element x € G is equal to the index of the centralizer Cg (x) of x in G. In particular, 
the number of conjugates to an element of G is a divisor of the order of G. tt 


The stablizer Stab(G, G) of the inner conjugation of GonGis{g € G| gxg°! = 


x forallx €G} = {g€G|gx = xg forall x € G}. Recall (Definition 4.4.15) 
that this is the center Z(G) of G. The representation p determined by the inner 
conjugation of G on G is given by p(g) = f,, where f, is the inner automorphism 
of G determined by g. Thus, p(G) is the group /nn(G) of all inner automorphisms 
of G. By the fundamental theorem of homomorphism, we have the following: 


Theorem 9.1.10 G/Z(G) © Inn(G). tt 
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An element x € G is a fixed point of the inner conjugation of G on G if and only 
if gxg-' = x forall g € G, or equivalently, xg = gx for all g € G. Thus, x € G 
is a fixed point of the inner conjugation of G on G if and only if x € Z(G). This 
shows that the fixed point set G° of the inner conjugation of G on G is the center 
Z(G) of G. Applying the Class equation (9.1.3) for the inner conjugation, we get 


the following: 


Theorem 9.1.11 /f G is finite, then 
|G|=| Z(G) | + YrealG : Ce(x)] (9.1.4) 


where A is a set obtained by choosing one and only one member from each conjugacy 
class which is not singleton. Each term under summation in the R.H.S. of (9.1.4) is 
greater than I, and also divides | G |. tt 


Equation (9.1.4) is called the classical class equation of G. 


Example 9.1.12 Let G be a group and X the set $(G) of all subgroups of G. Define 
an action * of Gon X byg* H = gHg™' (note that gHg7! is a subgroup of G and « 
is indeed an action). This action is called the inner conjugation of G on subgroups 
of G. 


Let H € S(G). Then, the Isotropy group Gy of the inner conjugation action at 
H is given by Gy = {9g €G|gHg! = H} = {g€G|gH = Hg}. Recall 
(Definition 4.4.15) that this subgroup of G is called the Normalizer of H in G, and it 
is denoted by Ng(H).SincehH = H = Hh forallh ¢ H, H C Ng(A). Further, 
if K is a subgroup of G such that H < K, then gH = Hg forall g € K, and so 
K C NG(A). Thus, we have the following: 


Proposition 9.1.13 The normalizer Ng(H) of H in G is the largest subgroup of G 
in which H is normal. tt 


A subgroup H of a group G is called a conjugate to a subgroup K of G if there 
is anelement g € G suchthat K = gHg~'. Thus, the orbit G « H at H of the inner 
conjugation of G on subgroups of G is the set Gx H = {gHg !|g€ G} of all 
conjugates of H in G. Proposition 9.1.3 applied to the inner conjugation of G on the 
set S(G) of all subgroups of G gives the following corollary. 


Corollary 9.1.14 There is a bijection from the set G/'Ng(H) of left cosets of G 
modulo Ng(f) to the set of all conjugates of H. If G is finite, then the number of 
conjugates to H is equal to[G : Ng(A)). The number of conjugates to H is a divisor 
of the order of G. tt 


Anelement H € S(G) will be a fixed point of the inner conjugation of G on S(G) 
if and only if gHg~! = H forall g € G, or equivalently, gH = Hg forallg € G. 
Thus, A is a fixed point if and only if H <I G, and so the fixed point set (S(G))° is 
the set of all normal subgroups of G. The class formula for the inner conjugation of 
G on S(G) becomes 
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| SG) | =| (S(G))° | + ZHealG : Nc(A)] (9.1.5) 


where each term under summation in the R.H.S. is greater than | and A is the 
set obtained by choosing one and only one member from each conjugacy class of 
subgroups which is not singleton. 


Example 9.1.15 Let G be a group and H a subgroup of G. Suppose that x; H = 
x2H. Then, (gx1)~!(gx2) = oe. € H. Hence, gx;H = gx2H. Thus, we have 
an action * of G on G/'H defined by g*xH = gxH. This action is called the left 
multiplication of G on G/'H. The isotropy group G, 4 of the action of G on G/'!H 
at xH is given by Gxy = {9€G|gxH = xH} = {g€G|x'gx eH} = 
{g€G|gexHx-'} = xHx7!. The orbitG*xH = {gxH|ge¢G} = G/'H. 
The stablizer Stab(G, G/'H) = Clee Gee = (hee xHx7~!. This subgroup of 
G is called the core of H in G, and it is denoted by Coreg(A). 


Proposition 9.1.16 Coreg(H) of H in G is the largest normal subgroup of G 
contained in H. 


Proof Clearly, Coreg(H) = (xg x Hx! is a normal subgroup of G. If K is a 
normal subgroup of G contained in H, then K = xKx~'! CxHx7! forall x € G. 
This shows that K C Coreg(A). tt 


Since the representation p determined by the left multiplication action of G on 
G/'H is ahomomorphism whose kernel is Coreg (H), we have the following: 


Proposition 9.1.17 G/Coreg(H) is isomorphic to a subgroup of Sym(G/'H). ¢ 


Example 9.1.18 This example is from the dynamics of projectile. (The acceleration 
g due to gravity is assumed to be constant). A projectile is completely determined 
by a point on the path of the projectile at a particular instant and the velocity of 
the particle at that point. Consider R° whose first three coordinates determine the 
position of a particle in the space, and the last three components give the components 
of the velocity of the particle along x, y, and z axes (z axis is vertical), respectively, 
at that instant. Consider the additive group R of reals. We have the following action 
x of R on R° defined by 


tx (a, 8, U1, U2, U3) = (Ut + a, Ut + B, ust — 4g? +7, U1, U2, U3 — gt), 


where ¢ represents time parameter. It is easy to check that * is an action. 


Example 9.1.19 Consider the general linear group GL(n, R). If we treat the mem- 
bers of R” as column vectors, then the matrix multiplication from left defines an 
action of GL(n, R) on R”. There are only two orbits of this action, viz., {0} and 
R’ - {0}. Further, all subgroups of GL(n, R) act on R”. In particular, Consider the 
action of the special orthogonal group SO (3) on the Euclidean 3-space R?. Let @ 
denote the unit vector [1, 0, 0]. Then, SO (3) - @" is the unit sphere S? with center 
origin. Indeed, for any A € SO(3), 
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(A-@')'- A-e@ = @1-e]. = 1. 


This shows that SO(3) -@' € S?. Also given any unit column vector X ins 2 let Y 
and Z be a pair of unit vectors such that {X, Y, Z} is a set of pairwise orthogonal unit 
vectors. Let A = [xX ‘ Y’, 71. Then, A € O(3). Interchanging the second and the 
third column, if necessary, we may assume that A € SO(3). Clearly A-@)' = x 
This shows that S? C SO(3) - @’. Thus, S? is an orbit of the action passing through 
the unit vector 2". 

Similarly, the sphere S*(r) with center origin and radius r, r > 0 is also an orbit. 
Indeed, S*(r) = SO(3)- rez’. {0} is the trivial orbit. Evidently, these are all the 
orbits of the action. 

Now, the isotropy subgroup SO(3)z = {A € SO(3)|A-@' = @%"} caneasily 
be seen to be the subgroup of SO(3) consisting of the matrices of the type 


1 O 0 
0 cosO +sin®@ |, 
0 ssiné cosé 


where 0 < @ < z. This subgroup is clearly isomorphic to SO(2). Without any loss, 
we may denote it by SO(2). Thus, we have the bijective map 7 from S'O (3)/"SO(2) 
to S? given by (A - SO(2)) = A-@'. This map is also a topological homeomor- 
phism. 


Now, we apply the theory of group actions to the structure theory of finite groups. 


Proposition 9.1.20 Let G be a group of order p",n > 1. Let H be a nontrivial 
normal subgroup of G. Then, H (\ Z(G) # {e}. In particular, Z(G) # {e}. 


Proof Since H <G,G acts on H through inner conjugation. The fixed point 
sett H° = {he H|ghg"' = h forall g€¢G} = {heH|he Z(G} = 
H () Z(G). Consider the class formula 


p’ =|H|=| H°| + SrealG: Gx] 


Since H # {e}, r > 1. Thus, L.HS. is divisible by p and the second term in 
the R.H.S. is also a multiple of p. Hence, p divides | H® |. Since ee H° = 
H(\ Z(G), H{)\ Z(G) contains at least p elements. Taking H = G, we observe 
that the Z(G) # {e}. tt 


Corollary 9.1.21 Let G is a group of order p*, where p is a prime. Then, G is 
abelian. 


Proof Consider Z(G). By the above proposition, | Z(G) | = p or p?. If | Z(G) | 
= p’,then Z(G) = G, and so G is abelian. We show that | Z(G) |4 p. Suppose 
that | Z(G) | = p, then | G/Z(G) | = p, and so G/Z(G) is cyclic. By Theorem 
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5.2.32, it follows that G is abelian. But, then Z(G) = G, and so| Z(G) | = p. 
This is a contradiction to the supposition. tt 


The following theorem classifies all groups of order p”, where p is a prime. 


Theorem 9.1.22 There are only two isomorphism classes of groups of order p’, 
where p is a prime. One is the cyclic group of order p?, and the other is direct 
product of two cyclic groups of order p. 


Proof If G contains an element of order p’, then it is cyclic. Suppose that it contains 
no element of order p?. Then, order of each nontrivial element of G is p. Let a be 
an element of G of order p. Then, |< a >| = p, and so there exists an element 
b ¢<a >. Clearly, b is also of order p, and < a >4#< b >. Since any nonidentity 
element of a cyclic group of order p is a generator of the group, < a > ()<b>= 
{e}. Hence, G = < a >< b >. Also, since G is abelian (by the above corollary), by 
Proposition 5.2.4, it is internal direct product of < a > and < b >. By Proposition 
5.2.3, G is isomorphic to the external direct product <<a > x <b>. ft 


Proposition 9.1.23 Let G be a group of order p",n > 1, where p is a prime. Then, 
to every divisor of p", there is a subgroup, in fact, anormal subgroup of that order. 


Proof The proof is by induction on n. Ifn = 1, then G is prime cyclic, and there is 
nothing to do. Assume that the result is true for a group of order p”. Let G be a group 
of order p"”*t!. By Proposition 9.1.20, Z(G) # {e}. Let ae Z(G), a & e. Suppose 
that | a|= p’,s > 1. Clearly, a is of order p orelse a” is of order p. Thus, there 
is an element b € Z(G) of order p. Take H =< b>. Then, H is a subgroup of 
G of order p. Since H is contained in Z(G), it is normal in G. Consider the group 

= G/H. Clearly, | G/H |= p”. By the induction hypothesis, corresponding to 
every divisor p',0 < t <n, there is a normal subgroup L of G/H of order p’. By 
Proposition 5.2.29, there is a unique normal subgroup K of G containing H such 
that L = K/H.Since| L |= p', by the Lagrange theorem, | K | = p’*!. ft 


The following proposition is a generalization of the result: “Every subgroup of 
index 2 is normal’. 


Proposition 9.1.24 Let G be a finite group, and p the smallest prime dividing the 
order of G. Then, every subgroup of G of index p is normal in G. 


Proof Let H be a subgroup of G of index p, where p is the smallest prime dividing 
order of G. Then, G/'H contains p elements. Consider the action « of G on G/'H 
defined by gx xH = gxH. The stablizer of this action is the Coreg(H) which is 
the largest normal subgroup of G contained in H. Also, G/ Coreg (fH) is isomorphic 
to a subgroup of Sym(G/!'H) ~ S,. Since p is the smallest prime dividing the 
order of G, (| G/Coreg(H) |,| Sp |) = 1 or p. If | G/Coreg(H) | = 1, then 
G = Coreg(A) © H, acontradiction to the supposition that[G : H] = p. Hence, 
[G : Coreg(H)] = [G: H]. Since Coreg(H) C H,H = Coreg(H)<iG. ¢ 
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Exercises 


9.1.1 Let G be a group which acts on X through the action *. Show that Gy, = 
gGxg_' forallg € Gandx € X.Deduce that {G,,. | g € G}isacomplete conjugacy 
class of subgroups. 


9.1.2 Let G be a group which acts on X. We also say that X is a G-set. A subset 
Y of X is called a G-subset if the action of G on X induces an action on Y, ie., 
g*y €Y forall g € Gandy € Y. Show that a subset of X is a G-subset if and only 
if it is a union of some of the orbits. 


9.1.3, Let G be a group which acts on X and also on Y. A map f from X to Y is 
called a G-equivariant map if f(g*x) = g* f(x) forall g € Gandx € X. Let f 
from X to Y be a G-equivariant map. Show that G, € Gy) forall x € X. 


9.1.4 Let G be a group and H, K be subgroups. Then, G acts on G/' H, and it also 
on G/'K through left multiplication. Show that there is a G-equivariant map from 
G/'H to G/'K if and only if there exists an element a € G such thataHa™! C K, 
and then f, : G/'H —>+ G/'K defined by f,(xH) = xa7'K is an equivariant 
map. 

Hint. Suppose that f(H) = a7!K. 


9.1.5 Call a bijective G equivariant map to be a G-equivalence. Show that G/! H 
is G equivalent to G/'K if and only if H and K are conjugates. 


9.1.6 Call an action * of G on X to be a transitive action if there is only one orbit, 
Le., given any x, y € X, there is an element g € G such that g*x = y. Suppose 
that G acts transitively on X. Let x € X. Show that the map f from G/’G, to X 
defined by f(gG,) = g*x is a G equivalence. 


9.1.7 Find the number of S4 equivalence classes of transitive S4 actions. 
9.1.8 Show that every G-set is disjoint union of transitive G-sets. 


9.1.9 Describe the action of the additive group R of real numbers on a suitable 
Euclidean space which describes a motion under a central force of attraction follow- 
ing the inverse square law. 


9.1.10 Describe all the equivalence classes of actions of Qs on a set containing 3 
elements. How many of them are there? 


9.1.11 (Cauchy—Frobenius). Suppose that G acts on a finite set X. Show that the 
number of orbits is 4, Djcg | {x € X | gxx = x} |. Deduce that if G acts transi- 
IG] “9 

tively, then Xgeg | {x EX | gxx = x}|=|GI. 

Hint: Consider the set Q = {(g,x)©€GxX|gxx = x}. Let {G*x1,Gx 
X2,...,G*x;} be the set of distinct orbits of the action. Then, | 22 |= Ygeg | {x € 
X|g*x = x} |. On the other hand, | Q| = Di) DreGax, | Gx | = UL, | GI. 
Equate the two. 
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9.1.12 Let G be a finite group and {a, a2, ...,a,} be a set obtained by choosing 
one and only one member from each conjugacy class of G. Then, show that 


1 1 
|Ce(ai)| Ls |\Ce(a)| + + 


1 = 
ICa(ar)| 
Deduce that there are only finitely many finite groups (upto isomorphism) which has 
a fixed number of conjugacy classes (the number of conjugacy classes of a group G 
is called the class number). Describe groups having class numbers 1, 2 and 3. 


9.1.13 Let H be a maximal subgroup of G. Suppose that H is not normal in G. 
Show that there are [G : H] conjugates to H. 


9.1.14 Let H be a proper subgroup of G. Show that G 4 eee gHg'. 


9.1.15 Let G be a group which has a proper subgroup of finite index. Show that G 
has a proper normal subgroup of finite index. Deduce that an infinite group having a 
subgroup of finite index cannot be simple. 


9.1.16 Show that Cg(H) < Nc(H). 


9.1.17 Let H be a subgroup of finite index of a finitely generated group G. Show 
that H is also finitely generated. 
Hint. Let S be a right transversal to H in G. Then, S is finite. Let Y be a finite set of 
generators of G. For each y € Y, and for each x € S, let a(x, y) be the element of 
HT determined by the equation xy = o(x, y)z, where z € S. Show that {o(x, y) | 
x € Sand y € Y} generates H. 


9.1.18 Let H be a subgroup of finite index in G. Show that Coreg(H) is a normal 
subgroup of finite index in G. 


9.1.19 A transitive action of a group G on X is said to be regular if G, = {e} for 
all x € X. Show that if G is regular, then | X | = | G |, and it is equivalent to the left 
multiplication on G. 


9.1.20 Show that every transitive faithful action of an abelian group is regular. 


9.1.21 Show that () 7. sic) Nc(H) 3G, and the corresponding quotient group is 
isomorphic to a subgroup of Sym(S(G)). 


9.1.22 Let G be a group of order p",n > 1, where p is a prime. Let r denote the 
number of subgroups of G and s the number of normal subgroups of G. Show that 
p divides r — s. 


9.1.23 Call an action of G on X to be doubly transitive if given x; 4 x2 and y; A yo 
in X, there isa g € Gsuchthatg*x,; = y, andg*x2 = yp. Suppose that a finite 
group G acts doubly transitively on X. Show that Xyeg | {x € X | gxx = x} P= 
2|G\. 
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9.1.24 Let G bea group which acts transitively on X. Show that G acts doubly tran- 
sitively if and only if for each x € X, the isotropy subgroup G, at x acts transitively 
on X — {x}. 


9.1.25 Let G be a finite group which acts doubly transitively on a set X containing 
n elements. Show that n(n — 1) divides the order of G. 


9.1.26 Call an action of G on X to be sharply doubly transitive if given x; # 
Xy and y; A Y2 in X, there is a unique g € G such thatg* x, = y, andgx*xx. = 
y2. Suppose that a finite group G acts doubly transitively on a set X containing n 
elements. Show that G acts sharply doubly transitively if and only if G is of order 
n(n — 1). 


9.1.27* Let G be a group which acts sharply doubly transitively on a set X. Show 
that G contains several elements of order 2 (called involutions), and all elements of 
order 2 form a complete single conjugacy class. Further, show that either all isotropy 
subgroups G, contain a unique element of order 2, or no isotropy subgroup contain 
an element of order 2. 


9.1.28 Let F be any field. Define multiplication: on G = F x F* by 


(x, a)-(y, b) = @& + ay, ab) 


Show that G is a group with respect to this multiplication. Define a map « from 


Gx FtoFb 
7 (x, a)xy =x + ay 


Show that « is an action which is sharply doubly transitive action on F. 


9.2 Sylow Theorems 


Structure of a finite abelian group is well understood. Indeed, finite abelian groups 
are completely classfied (see Sect.9.3). However, understanding the structure of a 
finite nonabelian group is extremely difficult problem. Perhaps, it is beyond dream to 
classify nonabelian finite groups. Mathematicians always roam around this problem. 
In the last section, we obtained some structural information about finite groups of 
prime power orders. In this section, we study finite groups by analyzing prime power 
order subgroups of the group. Basic results in this direction are Sylow theorems. The 
following is the Sylow Ist theorem. 


Theorem 9.2.1 (Sylow) Let G be a finite group and p" divides | G |, where p is 
prime. Then, G has a subgroup of order p’. 


Proof The proof is by induction on | G |. If| G | = 1, then there is nothing to do. 
Assume that the result is true for all those groups whose orders are less than | G |. 
Then, we have to prove the result for G. Suppose that p’, r > 1 divides the order of 
G. Consider the class formula 
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|G|=| Z(G) | + XrealG : Ce(x)] 


for the inner conjugation action of G on itself. The terms under summation in the 
R.HLS. are greater than 1, and they also divide | G |. There are two cases: 

(i) p divides | Z(G) |. 

(ii) p does not divide | Z(G) |. 

Consider the case (i). Since Z(G) is abelian, and p divides | Z(G) |, by the 
Cauchy theorem for abelian groups (Theorem 5.2.31), there exists a subgroup H 
of Z(G) of order p. Since every subgroup of Z(G) is normal in G, H <1 G. Now, 
consider G/#H. Clearly, | G/H | = a = - <| G |. Also p’~! divides | G/H |. 
By the induction hypothesis, G/H has a subgroup L of order p’~'. By Proposition 
5.2.29, there is a unique subgroup K of G containing H suchthat L = K/#H. Since 
| L |= p’—', by the Lagrange theorem, | K | = p”. But, then K is a subgroup of 
G of order p’. 

Now, consider the case (ii). Since p does not divide | Z(G) |, and p divides | G |, 
at least one term under summation in the R.H.S. of the classical class formula is 
not divisible by p. Suppose that p does not divide [G : Cg(x)], where [G : Cg(x)] 
is greater than 1. Thus, p does not divide ee Since p’ divides | G |, it follows 
that p’ divides | Cg(x) |. Further, since [G : Cg(x)] > 1,| Cg(x) | <| G |. By the 
induction hypothesis, Cg(x) has a subgroup of order p’, and so G has a subgroup 
of order p’. tt 


Corollary 9.2.2. Let G be a finite group, and a prime p divides the order of G. Then, 
G contains an element of order p. 


Proof From the above theorem, G has a subgroup H of order p. Any nonidentity 
element of H is of order p. tt 


Definition 9.2.3 A group G is said to be a p-group if order of each element of G is 
a power of p. 


Corollary 9.2.4 A finite group G is a p-group if and only if | G | = p" for some n. 


Proof If order of G is p”, then since order of each element of G divides order of G, 
order of each element of G is a power of p. Conversely, if order of each element of 
G is a power of p, then no other prime g can divide the order of G, for otherwise, 
by the above corollary, G will have an element of order q. tt 


Definition 9.2.5 A maximal p-subgroup of G is called a Sylow p-subgroup of G. 


Remark 9.2.6 Every group has a Sylow p-subgroup (may be {e}) for every prime p. 
This is an easy consequence of Zorn’s Lemma if we observe that union of a chain of 
p-subgroups is a p-subgroup. For finite groups, we have the following corollary: 


Corollary 9.2.7 Let G be a finite group. Then, G has a Sylow p-subgroup which is 
of order p", where p™ divides | G | but p'"*! does not divide | G |. 
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Proof Since G is finite, there exists m such that p” divides | G | but pt! does not 
divide | G |. From the Sylow Ist theorem, there is a subgroup P (say) of G of order 
p™. Since there is no higher power of p dividing | G |, P is a Sylow p-subgroup 
of G. o 


Example 9.2.8 Consider S3. The subgroup A3 is the only Sylow 3-subgroup of $3. 
The subgroups {/, (12)}, {7, (23)} and {7, (13)} are all Sylow 2-subgroups. For the 
rest of the primes, {7} is the only Sylow subgroup. 


Example 9.2.9 Consider the group A4. V4 is the only maximal 2-subgroup of Aq, and 
hence, this is the only Sylow 2-subgroup of A4. Further, {7, (123), (132)}, {7, (124), 
(142)}, {7, (134), 143)}, and {7, (234), (243)} are all Sylow 3-subgroups of Aa. 
There is no other prime dividing the order of A. 


Example 9.2.10 Consider the group S4 which is of order 24 = 23 x 3. There are 4 
subgroups of order 3 which are all Sylow 3-subgroups. They are {/, (123), (132)}, 
{T, (124), (142)}, {7, 134), (143)}, and {J, (234), (243)}.. For the Sylow 
2-subgroups, consider the Klein’s four subgroup V4 which is a normal subgroup 
of Sy. Let H = {I, (12)}. Then, H V4 is a subgroup of order 8. Thus, it is a Sylow 
2-subgroup of Sy. If we take K = {7, (13)},andZ = {I, (23)}, then K V4 and LV4 
are also Sylow 2-subgroups of S4. They are all. 


Proposition 9.2.11 Let P be a Sylow p-subgroup of G. Then, any conjugate gP g7' 


of P is also a Sylow p-subgroup of G. 


Proof Let P beaSylow p-subgroup of G. Since the subgroup g Pg! is isomorphic to 


P, itis p-subgroup. Further, if P’ is also a p-subgroup of G such that gPg7! C P’, 
then P C g"!P’g. Since g~! P’g is also a p-subgroup, and P is a Sylow (and so 
maximal) p-subgroup, it follows that P = g~!P’g. In turn, gPg-! = P’. This 
shows that gPg~! is a maximal p-subgroup, and so it is a Sylow p-subgroup. tt 


Corollary 9.2.12 If P is a unique Sylow p-subgroup of G, then it is normal. tt 


Corollary 9.2.13. The intersection of all Sylow p-subgroups of G is normal in G. 


Proof Let g denote the set of all Sylow p-subgroups of G. Then, IN Pep I = 


rep 9P9' = Mpeg P, for {gPg™' | P € 9} = g. This shows that (|p... isa 
normal subgroup of G. tt 


Proposition 9.2.14 Every p-subgroup of a finite group is contained in a Sylow p- 
subgroup. 


Proof Let H be a p-subgroup of a finite group G. If H is a maximal p-subgroup, 
then it is a Sylow p-subgroup. If not, then there is a p-subgroup H; containing H 
properly. H; may be a Sylow p-subgroup. If not, proceed. Since G is finite, the 
process stops after finitely many steps giving us a Sylow p-subgroup of G. tt 


Remark 9.2.15. The result of the above proposition is true even for infinite groups. 
This follows from an easy application of Zorn’s Lemma. 
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Proposition 9.2.16 Let P be a Sylow p-subgroup of a finite group G, and H a 
p-subgroup such that HP = PH. Then, H C P. 

Proof Since HP = PH, H P isasubgroup. Also| HP |= Vad is a power of p. 
Thus, HP isa p-subgroup of G. Clearly, P C HP. Since P is a Sylow p-subgroup, 
P = HP.Hence,H CHP = P. 


Corollary 9.2.17 Let P, and P, be Sylow p-subgroups of a finite group G such that 
P,P = PP. Then, P, = Py. 


Proof This is immediate from the above proposition. tt 


Corollary 9.2.18 Let P; and Pz be Sylow p-subgroups of a finite group G such that 
gP\g"! C P, for all g € P3. Then, P;} = Py. 


Proof If gPig"! © Pi, thengP\g-'! = P,, for gP\g~ is also a Sylow p-subgroup. 
Thus, gP,g-! = P, forall g € Pj, andso gP; = Pig for all g € P:. This means 
that P; Py) = PP), and hence, from the above corollary, P} = P. tt 


Corollary 9.2.19 Let P be a Sylow p-subgroup of a finite group G. Then, Ng(P) 
contains a unique Sylow p-subgroup of G. ft 


Following is the Sylow 2nd theorem. 


Theorem 9.2.20 (Sylow 2) Let G be a finite group and a prime p divides the order 
of G. Then, the set of all Sylow p-subgroups of G form a single complete conjugacy 
class of subgroups (i.e., conjugate of a Sylow p-subgroup is a Sylow p-subgroup, 
and any two Sylow p-subgroups of G are conjugate). Further, the number m of Sylow 
p-subgroups of G is of the form 1+kp, k = 0 (i.e.,m = 1(mod p)). 


Proof Let P; be a Sylow p-subgroup of G, and X = {P}, Po,..., Pm} be the set 
of all conjugates of P;. Then, we need to show that all Sylow p-subgroups of G are 
in X, and m = 1|(mod p). Since X is complete conjugacy class of subgroups of G, 
the group G, and so also P; act on X through inner conjugation. Consider the class 
formula 

m =|X|=|X" | + Leal Pi: (Pi)p)] (9.2.1) 


for the inner conjugation of P; on X, where X”' denotes the fixed point set of the 
action, and A is a set obtained by choosing one and only one member from each 


nonsingleton orbit. Suppose that P, € X”. Then, gP;g-! = P, forall g € P;. But, 
then P; P; = P,P, and hence, P; = P; (Corollary 9.2.17). Thus, xX? = {Pi}, 
and so | X” | = 1. Since each term under summation in the R.H.S. of the class 


equation is greater than 1, and also divides | P, |, it follows that the second term in 
the R.H.S. of (9.2.1) is a multiple of p. Thus, m = 1(mod p). It remains to show 
that each Sylow p-subgroup of G is a member of X. Suppose, if possible, that there 
is a Sylow p-subgroup P’ of G which is not in X. Then, P’ also acts on X through 


inner conjugation. Suppose P; € X”. Then, gP;g~! = P, for all g € P’, and so 
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P’P; = P,P’. Again from Corollary 9.2.17, P; = P’. Since P’ ¢ X, xX? = G, 


andso| X? |= 0. Looking at the class formula for this action, we observe that m = 
O(mod p). This is a contradiction to the already observed fact that m = 1(mod p). 
Thus, all Sylow p-subgroups of G are in X. tt 


Corollary 9.2.21 /f p™ divides | G |, and p'*' does not divide | G |, then all Sylow 
p-subgroups of G are of order p™. 


Proof From the above theorem, all Sylow p-subgroups are conjugate. Also all con- 
jugate subgroups have same orders, and there is a Sylow p-subgroup of order p” 
(Sylow Ist theorem). The result follows. t 


Since the number of conjugates to a subgroup is equal to the index of the normalizer 
of that subgroup in the group, the following corollary follows from the Lagrange 
theorem. 


Corollary 9.2.22 Thenumberm = 1 + kp of Sylow p-subgroups of G is a divisor 
of | G |. t 


Following corollary gives a sufficient condition for a Sylow subgroup to be normal. 


Corollary 9.2.23 Let p be a prime divisor of | G |. Suppose that 1 + kp divides 
| G | only ifk = 0. Then, there is a unique Sylow p-subgroup of G which is normal. 


Proof From the above corollary, there is a unique Sylow p-subgroup of G, and since 
conjugate to a Sylow p-subgroup is again a Sylow p-subgroup, it follows that the 
Sylow p-subgroup is normal. tt 


Corollary 9.2.24 Let H be a normal p-subgroup of a finite group G. Then, H is 
contained in each Sylow p-subgroup of G (i.e., Nee P is the largest p-subgroup 
of G which is normal in G). 


Proof Let H be a normal p-subgroup of G. Then, by Proposition 9.2.14, H is 
contained in a Sylow p-subgroup P (say) of G. Let P’ be any Sylow p-subgroup 
of G. Then, by Theorem 9.2.20, P’ = gPg™! for some g € G. Thus, gHg"! C 
gPg | = P’. Since H is supposed to be normal in G, gHg~! = H, and hence, 
HCP’. tt 


Proposition 9.2.25 A finite group G is direct product of its Sylow subgroups if and 
only if all Sylow subgroups are normal. 


Proof \f G is direct product of all its Sylow subgroups, then by Corollary 5.2.25, 
all Sylow subgroups of G are normal. Conversely, suppose that all Sylow subgroups 
of G are normal. Suppose that | G | = p}'p5’...- p®’, where pj, po,..., py are 
distinct primes and a; > 0. Let P; be the Sylow p; subgroup of G (since each 
Sylow p; subgroup is normal, they are unique). Then, by the hypothesis, each P; < 


G. Since products of normal subgroups are normal P; Po... P)—1 Pi41Pi+2...P, 
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is normal in G for each i. Next, since P;} () Px = {e},| PiP2|=|Pi || |= 
P\' + p>’. Proceeding inductively, we find that | P; || Po |... | P: |= py" ps... py" 
for all t. Thus, | G | = | P; || Po |...| P, |, and hence, G = P,P,...P,. Also 
P; (\(Pi Pp... Pi-1 Pi41 Pi42-..P-) = {e}. It follows from Corollary 5.2.25 that G 
is direct product of all its Sylow subgroups. tt 


nz 


Corollary 9.2.26 Let G be a group of order p}' ps’... pr", where pi, p2,.-- Pr 
are distinct primes, and n; € {1,2}. Then, G is abelian if and only if all its Sylow 
subgroups are normal. 


Proof If G is abelian, then all its subgroups, and in particular all its Sylow subgroups 
are normal. Further, if all Sylow subgroups of G are normal, men G is direct product 
its Sylow subgroups P|, P2,..., P,. Each P;, being of order Pp; , nj; < 2, is abelian. 


Since direct product of abelian groups are abelian, G is abelian. tt 


Since direct product of cyclic groups of co-prime orders are cyclic, we have the 
following corollary. 


Corollary 9.2.27 Let G be a group of order p\ po... Pr, where Pp, P2,---, Pr are 
distinct primes. Then, G is cyclic if and only if all its Sylow subgroups are normal. t 


Corollary 9.2.28 Let p\, p2,..., Pr be distinct primes. Then, there are exactly 
2' nonisomorphic abelian groups of order p}' ps’... pt’, ni € {1,2}, where t = 
X/_yn; — r. In particular, there is only one (upto isomorphism) abelian group of 
order p\ p2... py which is cyclic. 


Proof Let G be an abelian group of order p/'' p;’ ... p”. Then, G is isomorphic to 
the external direct product of its Sylow schoroups B, Ps, ..., P, which are of orders 
Pi's Po’, --+5 ph", respectively. Ifn; = 1, then P; is mecessadly cyclic of order p;. 
If n; = 2, then P; has 2 possibilities, viz. Z ee and Zp, x Zp,. In turn, there are 
exactly 2' possibilities for G, wheret = Xj_,n; — r. In particular, if G is abelian 
group of order p; p2... p;, then it is isomorphic to direct product of distinct prime 
cyclic groups, and so it is cyclic. tt 


Let G be a finite group, and H a subgroup of G. Let P be a Sylow p-subgroup 
of G. Then, H (] P need not be a Sylow p-subgroup of H. For example, P = 
{I, (234), (243)} is a Sylow 3-subgroup of Sy, and $3 = {J, (12), (23), (13), (123), 
(132)} is a subgroup of S4 whereas $3() P = {J} is not a Sylow 3-subgroup of $3. 
However, we have the following: 


Proposition 9.2.29 Let H be a subgroup of a finite group G, and P a Sylow p- 
subgroup of G such that H P is a subgroup of G (in particular H may be normal 
or P may be normal in G). Then, H (\ P is a Sylow p-subgroup of H. Further, if 
HAG, then HP/H isa Sylow p-subgroup of G/H. 


|H| __ JH P| 


Proof Since H () P isasubgroup of P, itisa p-subgroup of H. Also a P= WP 
Since HP is a subgroup of G,[G: P] = [G: HP]-[HP.: P]. Further, since P 
is a Sylow p-subgroup of G, [G: P] is co pmme 7 p, and so [HP: P] is also 
co-prime to p. This means that p does not divide Hence, H () P is a Sylow 
p-subgroup of H. 


Tanta 
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Next, if H is a normal subgroup of G, then HP/H ~ P/(H () P) (Noether 
IG 
|G/H| [HI |G| 


isomorphism theorem) is a p-subgroup of G/H. Also lHP/H] — WT = [HP is 
TAT 
co-prime to p. This shows that H P/H is a Sylow p-subgroup of G/H. tt 


Under a homomorphism, image of a Sylow p-subgroup need not be a Sylow p- 
subgroup, though it is a p-subgroup. For example, the trivial homomorphism from 
S4 to Aq does not take any Sylow 3-subgroup to a Sylow 3-subgroup. However, from 
the fundamental theorem of homomorphism, and the above proposition, we get 


Proposition 9.2.30 Under a_ surjective homomorphism, image of a Sylow 
p-subgroup is a Sylow p-subgroup. tt 


Inverse image of a Sylow p-subgroup under a homomorphism need not even 
be a p-subgroup. For example, the first projection from A4 x Zz to Ag is a sur- 
jective homomorphism, and the inverse image of the Sylow 3-subgroup A3 = 
{Z, (123), (132)} is A3 x Z2 which is not a 3-subgroup. However, we have the fol- 
lowing: 


Proposition 9.2.31 Let G, be a finite group, and f a surjective homomorphism from 
G\ to G». Let P be a Sylow p-subgroup of G2. Then, f~'(P) is a Sylow p-subgroup 
of G, if and only if the kernel of f is a p-subgroup of G. 


Proof By the fundamental theorem of homomorphism, f~!(P)/ker f ~ P. Thus, 
f—'(P) is a p-subgroup if and only if ker f is a p-subgroup. Further, then, [G, : 
f-'\(P)] = (G,/ker f : f~'(P)/ker f] = [G2 : P]isco-prime to p. This shows 
that f~!(P) is a Sylow p-subgroup of Gj. tt 


Applications of Sylow Theorems 


9.2.1. Let G be group of order pq, where p and q are primes. Suppose that p = q. 
Then, | G | = p’, and so G is abelian. There are only two possibilities for G. It 
is isomorphic to Z,2 or to Z, x Zp. Next, suppose that p > q. Then, 1 + kp will 
divide pq only whenk = 0. Thus, there is a unique Sylow p-subgroup P of order p, 
which, therefore, is normal in G. In particular, a group of order pg cannot be simple. 
Further, suppose that g does not divide p — 1. Then, | + kq also cannot divide pq, 
unless it is 1. But, then, Sylow q-subgroup Q is also normal in G. From Corollary 
9.2.27, it follows that G is cyclic. Thus, for example, every group of order 15 is 
cyclic. Let G be a group of order pgr, where p,q and r are distinct primes such 
that 1 + kp divides pqr only ifk; = 0, 1+ k2q divides pqr only if ky = O and 
1+ k3r divides pgr only if k3 = 0. Then, all Sylow subgroups of G are normal, 
and again by Corollary 9.2.27, G is cyclic (for example, every group of order 1001 
is cyclic). A complete classification of groups of order pq will follow in the last 
illustration of this section. 

9.2.2. Every group of order 57-77 = 1225 is abelian, and there are four iso- 
morphism classes of groups of order 1225: Indeed, 1 + 5k divides 5* - 7* only when 
k = 0,1+71 divides 5*-7? only when/ = 0. Thus, there is a unique Sylow 
5-subgroup P (say) which is of order 57, and also a unique Sylow 7-subgroup Q 
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(say). In turn, P and Q are normal subgroups of G. But, then, G is direct product 
of P and Q. Since direct product of abelian groups are abelian, G is abelian. Fur- 
ther, P © Zs2 or P © Zs x Zs. Similarly, Q © Zp or OQ © Z, x Z7. Thus, G is 
isomorphic to one and only one of the following groups: (i) Z1225 © Zs2 x Z72, (ii) 
Zs X Z5 X Zp, (ili) Z5 xX Z5 X Z7 X Z7, OF (iV) Z52 X Z7 X Z7. 

9.2.3. Let G be a group of order 7°57. We describe the center of G in case it is 
nonabelian: By the Sylow theorem, there is a unique Sylow 7-subgroup, and also a 
unique Sylow 5-subgroup. Let H denote the Sylow 7-subgroup, and K the Sylow 
5-subgroup. Then, G is the direct product H x K. Since K (being of order 5”) is 
abelian, and G is assumed to be nonabelian, H is nonabelian. The center Z(H) 
of H cannot be of order 7”, for otherwise H/Z(H) will be cyclic, a contradiction 
(Theorem 5.2.32). Hence, the center of H is a cyclic group of order 7. By Corollary 
9.1.21, and Theorem 9.1.22, Z(K) = K is either Z5 or Zs; x Zs. Thus, there are 
exactly two possibilities for the center Z(G) of G, viz. Z7 x Zos and Z7 x Zs x Zs. 

9.2.4. Let G be a group of order p” - t, where pisaprime,n > 2andt < 1+ 2p. 
Then, G has a normal subgroup of order p”, or if not, tf = 1+ p and ithas anormal 
subgroup of order p”~!. 

Proof. If t = p, then by Proposition 9.1.23, G has a normal subgroup of order 
p". Suppose t 4 p. The number m of Sylow p-subgroups of G is of the form 1 + kp 
which divides p” - t. Since it has no common factor with p”, it divides t. Thus, 
it is 1, unless t = 1+ p. Hence, the Sylow p-subgroup is normal, unless t = 
1+ p. Suppose thatt = p+ l,ie,|G|= p”-(p+1), and Sylow p-subgroup 
is not normal. In this case, the number of Sylow p-subgroups is 1+ p. Let X = 
{P,, Po,..., Pizp} be the set of all Sylow p-subgroups of G. Let H = P, () Po 
and | H |= p’.Now,| P,P, |= BRE = pr < p". (p+ 1). Hence, p"" < 
p +1. This implies that n —r < 1, and sor >n-— 1. Since P; and Py are distict, 
| H | = p"—'!. Consider the action of H on X through the inner conjugation. The 
class formula for this action is 


1+p =|X|=|X"| + ZpjealH : Hp\l, 


where A is as usual. Since H = P;() Po, hx P, = hP,h-! = P, forallh € H, 
and alsoh * Py) = hPj)h~'! = P) for all h € H. Thus, P; and P, belong to X”, 
and so | X” |> 2. Since the second term of the class formula is a multiple of p, it 
follows that X” = X.Thus,AP;h~' = P, foralli and forall h € H.This shows 
that HP; = P;H for alli. Since each P; is a Sylow p-subgroup, H = P, () P2 © 
Pi () Po(\-+: 1) Pizp © Pi () Po. Therefore, H, being the intersection of all Sylow 
p-subgroups, is a normal subgroup. 

9.2.5. Let G be a group of order p- g -r, where p, q,r are distinct primes with 
p <q <r. Then, sylow r-subgroup of G is normal. 

Proof. Suppose that neither Sylow qg-subgroup nor Sylow r-subgroup is normal in 
G.Letm,; = 1+ kyr be the number of Sylow r-subgroups, andm2 = 1 + koq the 
number of Sylow q-subgroups of G. Since these Sylow subgroups are not normal, 
m,; > 1,mz > 1, and they also divide | G |= pgqr.Clearly,m, 4 pandm, # q. 
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Since p < gq < r,m, = 1+kir = p-q. Thus, there are p - q distinct groups 
of order r. Similarly, mz = 1+ koq is at least r, and so there are at least r distinct 
subgroups of order g. Since every nonidentity element of a cyclic group of prime 
order is a generator of the cyclic group, we see that distinct Sylow r-subgroups have 
only identity element in common. Thus, there are p - q - (r — 1) elements in G of 
order r. Similarly, there are at least r - (gq — 1) distinct elements of order q. Further, 
there should be at least p — 1 elements of order p. Therefore, G should contain at 
leattp-q-(Vr-l+r-(¢g-D+p=p-q:r4+(q-1)-( -— p) elements. 
Since p < randgq > 1, the second term is positive. But, this would mean that G 
contains more than p - q - r elements. This is a contradiction. It follows that Sylow r- 
subgroup is normal, or Sylow q-subgroup is normal. If Sylow r-subgroup is normal, 
we are done. Suppose that the Sylow g-subgroup Q is normal. Let R be a Sylow 
r-subgroup of G. Then, QR is also a subgroup of G of order g - r, and the index of 
QR is p. Since p is the smallest prime dividing the order of G, QR is normal in G. 
Further, | QR |= q-r,andr > q. It follows that R is normal in QR, and there is 
only one Sylow r-subgroup in QR. Let g € G. Then, gRg-! C gQRg"! = OR, 
for (QR <1G). Thus, R and gRg™! are both Sylow r-subgroups of QR. Hence, 
gRg | = R. Since g is arbitrary element of G, R <G. tt 

9.2.6. Let G be a finite group, and p a prime which divides the order of G. Let P 
be a Sylow p-subgroup of G, and H a subgroup of G such that Ng(P) C H. Then, 
NG(H) = H.In particular, Ng(NG(P)) = NG(P). 

Proof. Clearly, H C Ng(H).Letg € NG(H).Then,gHg~'! = H.Since Ng(P) © 
H, P CH. Hence, gPg-! C gHg"! = H. Thus, P and gPg"! are both Sylow 
p-subgroups of H. By the Sylow theorem, they are conjugate in H. Hence, there 
exists an element h € H such thathPh~! = gPg™!,i.,h~'gP = Ph7'g. This 
shows that h~'g € NG(P) C H. Sinceh € H, g € H. t 


Structure of Groups of Order pq 


Let G bea group of order p - g, where p and q are primes. If p = q,then| G |= p’. 
By Theorem 8.1.22, G is isomorphic to one of the following: 


(i) Zp. 
(ii) Zp X Zp. 


Assume that p 4 qg. Without any loss, we may assume that p > q. Then, by the 
Sylow 2nd theorem, there is unique Sylow p-subgroup P (say) which is normal in 
G. It is of order p, and so cyclic. Suppose that P =< a >, where a is an element 
of order p. Now there are two cases: 


(i) g does not divide p — 1. 
(ii) q divides p — 1. 


Consider the case (i). In this case, Sylow g-subgroup is also normal (Sylow 2nd 
theorem). Hence, by Corollary 9.2.27, G is necessarily cyclic group of order pq, 
which, therefore, is isomorphic to Zpg = Zp x Zy. 
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Now, consider the case (ii). In this case p is of the form | + kg. The number m of 
Sylow g-subgroups is also of the form | + /q (by Sylow 2nd theorem), and it divides 
p-q. Thus, there are two possibilities: 


(a) m = 1. 
(b) m = 1+kq = p. 


In case (a), there is unique Sylow g-subgroup Q, which is normal and also cyclic of 
order q. It follows that G is direct product of a cyclic group P and a cyclic group Q 
which are of co-prime orders. Therefore, in this case G is isomorphic to Zp,. 

Next, suppose that G has p = 1+ kq Sylow q-subgroups. Let Q =< b> be 
a Sylow q-subgroup. Then, | b | = qg. Since P()Q = {e}, | PQ|= p-q, and 
soG = PQ. In other words 


G = {a'b/,O<i < p,0<j < qh. (9.2.2) 


To determine the group G, we need to determine a rule by which we can multiply 
a‘b/ and a‘b!. In other words given i, j,k and/, to determine u,v, O<u < pand 
0<v < qsuch that (a'b/) - (a*b') = ab’. Since P dG, bab“! € P =<a>. 
Suppose that 

bab"! = a’, O<r < p. 


Obviously r £0. If r = 1, then ba = ab, and then G will become abelian, a 
contradiction to the supposition that Sylow q-subgroup is not normal. Hence, 


bab"! = a’, l<r< p. (9.2.3) 

Since | b| = g, b? = e,andsoa = blab~4 = a”™.Thus,a”™—! = e. Since 
|a|= p, p divides rf — 1. This means that 

rf = 1 (mod p), (9.2.4) 


and of course, r € | (mod p). 

Clearly, (r,p) = l,andsor<e U,, rF 1. Since U p 18 acyclic group of order 
p — 1 (Corollary 7.6.23), andg divides p—1, H = {ae U,|a? = 1} is acyclic 
subgroup of U, of order g. Thus, there are g — 1 solutions of the equation 


x1 =1(mod p), 1 < x < p (9.2.5) 


Now, given any solution r of this equation, we can determine the product in G as 
follows: 


abi -akb! = albia‘b-ibi*" = ai(biab-/)kbit! = ai(a”)kbi*! = 
gitikpiti = ab’, 
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where u is the remainder obtained when i + r/k is divided by p, and v is th remainder 
obtained when j +/ is divided by g. Thus, G is determined once we know r with 
the required property. 

Now, it appears that there may be g — | such groups corresponding to each solu- 
tion of Eq. (9.2.5). But they are all isomorphic. The proof of this fact is as fol- 
lows: Let r and s be two numbers such that r? = 1 (mod p), 1 < r < p and 
sf =1 (mod p), 1 < s < p.Then,r and s are nontrivial elements of H which 
is a cyclic group of prime order q. Since any nontrivial element of a cyclic group 
of prime order is a generator of the group, 7 and s both generate H. Thus, we have 
m,n, 1<m < qand|l<n < q such that 


r” = s(mod p). (9.2.6) 


and 
s” =r(mod p). (9.2.7) 


Let G; = {aib/ |0<i<p—1,0<j<q-l,a@ = e = bland bab“! = 


a’}.andG2z = {aici |O<i<p- 1lO<j<q-l@=e= clandcac’! = 


a*}. Define a map ¢ from G, to Gz by 

d(a'b/) = aic™, 
and a map ~ from G2 to Gy by 

waicl) = aib™. 


Since r” = s(mod p) and s" =r(mod p),r”™” 


= qinU,, q divides mn — 1. Thus, 


= (r™)" =r(mod p). Since | F | 


mn = \(mod q). (9.2.8) 


Now wod(aib/) = waic4) = aib™ = a'b/, for by Eq.(9.2.8), mnj = 
j(mod q). Thus, woe = Ig,. Similarly, oy = Ig,. This shows that ¢ is bijective. 
Further, 


o(a'b/) = p(aitr*pi+y = gittlk ent | (9.2.9) 


and 
p(aib!)- d(akb') = aic™ -akc™ = ait kent, (9.2.10) 


Since s” =r(mod p), s") =r/(mod p), or s“k =rik(mod p). Since | a| = 
p, aits"k — aqi+’k. Thus, from (9.2.9) and (9.2.10), it follows that @ is an iso- 
morphism. 
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To summarize, we have proved the following theorem. 


Theorem 9.2.32 Let p and q be primes, and G a group of order p - q. 

(a) If p = q, then G is isomorphic to one and only one of the following two 
groups: 

(i) Zp. 

(ii) Zp X Zp. 

(b) If p Aq, p > q and q does not divide p — 1, then G is isomorphic to Z pq. 

(c) If q divides p — 1, then G is isomorphic to one and only one of the following 
two groups: 

(i) Zp. _ 

(ii)G = {a'b! |0<i < p—1,0< j <q — 1}, where the multiplication in G 
is determined by the following rule: Take any nontrivial solution r of the equation 
x? =1(mod p),2 <x < p—1, and then (a'b/)(a*b') = ab’, where u is the 
remainder obtained when i + r/k is divided by p, and v is the remainder obtained 
when j +1 is divided by q. 


Example 9.2.33 Any group of order 15 is cyclic, for 15 = 5 - 3 and 3 does not divide 
5—1=4. 


Example 9.2.34 Let G be a nonabelian group of order 2p, where p is an odd 
prime. Then, r = p—1 is the only solution x7 = 1(mod p), 2<x < p—1l. 
Thus, G = {a'b/ |0 <i < p—1, 0< j < 1}, and the multiplication is given by 
(a'b/) - (a‘b') = a“b’, where u is the remainder obtained when i + (p — 1)/k 
is divided by p, and v is the remainder when j +/ is divided by 2 (observe 
that bab-! = a?! = a7'). Since there is only one nonabelian group of order 
2p, p = 3, and the dihedral group D, of the group of isometries (see Example 
4.1.28) of a regular polygon of p sides is nonabelian of order 2p, G is isomorphic 
to D,. Indeed, the isomorphism is given by the map a‘b/ ~+ pio’, where p is the 
rotation about the center of the polygon through an angle 2, and a is a reflexion 
about a line joining a vertex of the polygon with the middle point of the opposite 
edge. 


Example 9.2.35 Let G be a nonabelian group of order 21. We find that 2 and 4 are 
the solutions of the equation x? = 1(mod7),1 < x < 7.Takingr = 2 the product 
(ab*)(a°b), for example, is a2+? b> = a®b® = a?. 


Exercises 
9.2.1 Find all Sylow subgroups of A4, S4, As, and Ss. 


9.2.2 Find Sylow p-subgroups of the group GL(2, Z,) of invertible 2 x 2 matrices 
with entries in the field Z,. 


9.2.3, Show that every group of order 35 is cyclic. 


9.2.4 Show that every group of order 4199 is cyclic. 
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9.2.5 Let G be a group of order pq, where p and q are prime. Show that G cannot 
be simple. 


9.2.6 Let G be a group of order pq, where p and q are primes such thatg < p < 
2q + 1. Show that G is cyclic. 


9.2.7 Show that every noncyclic group of order 6 is isomorphic to $3. 
9.2.8 Give a presentation of a nonabelian group of order 33. 


9.2.9 Show that every group of order 48,841 is abelian. Find the number of noniso- 
morphic groups of this order. 


9.2.10 Show that every group of order 99 is abelian. 
9.2.11 Show that a group of order 12 cannot be simple. 
9.2.12 Show that a group of order 17 - 23 - 29 cannot be simple. Is it cyclic? 


9.2.13 Let G be a group of order p*-(p+1)-m such that (p+ 1)-m < p’, 
where p is prime. Show that G has a normal subgroup of order p* or a normal 
subgroup of order p?. 


9.2.14 Show that every group of order 216 contains a normal subgroup of order 27 
or a normal subgroup of order 9. 


9.2.15 Show that a group whose order is less than 60 is simple if and only if it is 
cyclic group of prime order. 


9.2.16 Show that a group of order 28 having a normal subgroup of order 4 is abelian. 


9.2.17 How many nonisomorphic groups of order 55 are there? How many elements 
of order 5in a nonabelian group of order 55 are there? 


9.2.18 Show that a group of order p* - (p + 1) has a normal subgroup of order p* 
or a normal subgroup of order p>. 


9.2.19 Let H bea proper subgroup of a group G such that| G |= p”,n > 2. Show 
that there exists x € G— H such thatxHx~' = H. 


9.2.20 Find the number of Sylow p-subgroups of S,. 
9.2.21 Show that a subgroup of order p"~! of a group of order p” is normal. 


9.2.22 Let G bea group of order p” - g, where p and q are primes. Let P be a Sylow 
p-subgroup of G. Show that either P is normal or Ng(P) = P. 


9.2.23 Let P be a normal Sylow p-subgroup of a finite group G and f an endo- 
morphism of G. Show that f(P) C P. 
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9.2.24 Let H be anormal subgroup of G such that p does not divide | G/H |. Show 
that H contains all Sylow p-subgroups of G. 


9.2.25 Let H be anormal subgroup of a finite group G, and P a Sylow p-subgroup 
of G which is contained in H as a normal subgroup. Show that P JG. 


9.2.26 Let G be a group of order p” - t, where p is a prime and n > 2. Assume that 
t < 2p. Show that G cannot be simple. 


9.2.27 Let G be a group and a ¥ e. Suppose that a is of finite order, and also the 
conjugacy class determined by a is finite. Show that G has a nontrivial finite normal 
subgroup. 

9.2.28 Find a nontrivial solution r of x? = 1(mod 13), and then find (a>b”) - (a’b), 


where a!? = e = bo andbab"! = a’. 


9.2.29 Show that if a is conjugate to b, then Cg(a) is conjugate to Cg (b). 


9.2.29* Show that every simple group of order 60 is isomorphic to As. 
Hint. Show the existence of a subgroup H of order 12, and then consider the action 
of GonG/'H. 


9.2.30 Let G be a group of order p?q. Show that G has a normal Sylow subgroup. 


9,.2.31* Let G be group of order p*q. Suppose that G has no normal Sylow sub- 
groups. Show that G is isomorphic to S4. Deduce that no group of order p*q can be 
simple. 


9.2.32* Let G be a group of order p’"q, where p and q are distinct primes. Suppose 
that Sylow p-subgroup of G is not normal in G. Show the following: 


(i) There are g Sylow p-subgroups. 

(ii) Let K be amaximal member among the intersections of distinct pairs of Sylow 
p-subgroups, and H = Ng(K). Show that H has at least two distinct Sylow 
p-subgroups. 

(iii) Show further that H contains exactly g Sylow p-subgroups. 

(iv) Deduce that all Sylow p-subgroups of H are contained in K. 

(v) Show that every Sylow p-subgroup of a finite subgroup is obtained by taking 
the intersection of a Sylow p-subgroup of the group with that subgroup. 

(vi) Use (v) to deduce that K = {e}. 

(vii) Deduce that the intersection of any two distinct Sylow p-subgroup is trivial. 
(viii) Counting the elements of G, show that there is unique Sylow g-subgroup of 
G. 
(ix) Deduce that G cannot be simple. 


Remark 9.2.36 More generally, no group of order pq” can be simple, where p and 


q are primes. This was proved by Burnside in the beginning of the 20th century using 
the representation theory. The proof will be given in the chapter on representation 
theory, Algebra 2. A nonrepresentation theoretic proof of this fact was obtained quite 
late with the works of Thompson, Goldschmidt, Bendor and Matsuyama. 
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9.2.33 Describe all nonabelian groups of order 8. 


9.2.34 A finite group G is called a CLT group, if corresponding to every divisor m 
of the order of the group G, there is a subgroup of order m. Show the following: 


(i) Every finite p-group is a CLT group. 

(ii) Suppose that every Sylow subgroup of G is normal. Show that it is a CLT 
group. 

(iii) Every finite abelian group is a CLT group. 

(iv) Give an example to show that subgroup of a CLT group need not be a CLT 
group. 

(v) Give an example to show that quotient of a CLT group need not be a CLT group. 

(vi) Show that $4, a CLT group. 


9.2.35* Describe all nonabelian groups of order p*, where p is a prime. 
9.2.36* Describe all groups of order pqr, where p, q, r are primes. 


9.2.37* Let G be a finite p-group, where p is an odd prime. Show that all subgroups 
of G are normal if and only if G is abelian. Deduce that all subgroups of an odd 
ordered group G is normal if and only if G is abelian. Give an example to show that 
the result is not true for p = 2. 

Hint. Observe that every nontrivial normal subgroup of a p-group contains a non- 
trivial element of the center. Use Induction. 


9.3 Finite Abelian Groups 


In this section, we classify all finite abelian groups. Since all Sylow subgroups will 
be normal, a finite abelian group is direct product of its Sylow subgroups. As a 
consequence, it follows that two finite abelian groups are isomorphic if and only if 
there corresponding Sylow subgroups are isomorphic. Thus, it is sufficient to classify 
all finite abelian groups of prime power orders. 


Theorem 9.3.1 Let G be a finite abelian p-group of order p". Then, there exist pos- 
itive integersn, >n2 >--->n;, ny +n2+---+n, = n together with elements 
X1,X2,...,X, in G such that | x; |= p" for alli, and G is the direct product 


<x) >X <> XX <u>. 


Proof The proof is by induction on n. If n = 1, then G itself is cyclic generated 
by any nonidentity element of G. Assume that the result is true for all those abelian 
groups of orders p”, where m < _ n. Let G be an abelian group of order p”. We have 
to prove the result for G. Let x; be an element of maximum order p”' inG.Ifn,; = n, 
then G is cyclic and there is nothing to prove. Suppose that n; < n. Consider the 
group H = G/ < x; >. Clearly,| H | = 5 = p™™, n—n, < n. By the 
induction hypothesis, 
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H =< v(y2) > Xx <v(y3) > X--- xX < Vy) >, 


where v is the quotient map from G to H = G/ < X, >, y2, ¥3,.--,Yy €G, 
lv(y) |= p”,andnz >n3>--- >n,. Since p”! is maximum among the orders 
of elements of G, and | v(y) | divides | y | for all y € G, it follows that n; > nz > 
ara = Ny. 

Next, we show the existence of z; € G such that v(z;) = v(y;), and | z; | 
= | (zi) | = | vy) | = p” for all i = 2. Since | u(y) | = | Git < 41 >) | = 
Dp", py t <x, > =< x, > forall i > 2. Hence, p”y; €< x; > for alli > 2. 
Suppose that p"y; = p"ajx,, i > 2, where (p,a;) = landt; <n,.Ift; = m, 


then p”'y; = 0 and | y; | divides p”. Since p”™ = | v(y;) | divides | y; |, it fol- 
lows that | y; | = | v(y;) |, and we can take z; = y;. Suppose that t; < n,. Since 
(a;, p"') = 1, by the Euclidean algorithm, there are integers wu and v such that 
ua; + up” = 1. But, then x} = waj;x,, and so | a;x; | = |x; | = p”. Thus, 
| py: | = | p’ajx, | = p™—". Hence, | ys | = p™ "*". Since p” is largest 


among the orders of elements of G,n, — t; +1; < n,. This shows that n; < t;. Take 
Zi = Yi — p" “a;x,. Then, v(z;) = v(y;), and| z |= p™ =| v(yi) |. 
Finally, we shall show that 


G=<x,>xX<2>X-+:+X<Z>. 


Letx € G.Then, v(x) € H.Since H= < v(y2) > x < V(y3) > X-+- xk < Vy) > 
=< V(Z2) > X < V(Z3) > X-++X < YZ) >, it follows that 


V(x) = aQu(z2) + agy(z3) + +++ aU (Zr). 
for some integers a2, a3,..., @,. This means that 


X+ <x, >= (Q2Z%2 + 0373 + +++ + OZ) + <x, >. 


Hence, x Q2Z2 0323 ee a,Zz, belongs to < x; >. In other words, 
xX = ax; + Q2zZ72 + ---+ a@,z, for some integers a), Q2,..., Gy. 

Next, suppose that ayx; + Qoz2 + +--+ a,z, = 0. Then, agv(z2) + 
-++Q;V(Z-) =< x; > the zero of H. Since H =< v(z2) > X-+++X < V(Z) >, 
ajV(Z;) =< x; > foralli > 2. Since | z; | =| v(z; >|, it follows that a;z; = 0 for 
alli > 2. In turn, a,x, is also 0. This shows thatG =< x, > xX <Z72>X°:+::X < 
Zp >. t 


Corollary 9.3.2 If G is a finite abelian group, then G is a direct product of cyclic 
groups of prime power orders. 


Proof Every finite abelian group is direct product of its Sylow subgroups which are 
of prime power orders. The result follows from the above theorem. tt 


Theorem 9.3.3 Let G and G’ be finite abelian p-groups. Suppose that 


9.3 Finite Abelian Groups 337 
G=<x,>X <x%2> XX <xX > 


and 
Go =<y>X <y>X-X <Yy>. 


are direct decompositions of G and G', respectively, as direct product of cyclic groups 


with |x; |= p™ ly l= p', nm 22 2-+-2N,, and m, > m2 >---> ms. 
Then, G is isomorphic to G' if and only if 

(i) r=s. 

(ii) nj = m; foralli. 


Proof If part of the proof is evident because any two cyclic groups of same orders are 
isomorphic, andif H ~ H’ and K ~ K’,then H x K isisomorphic to H' x K'. The 
proof of the converse is by induction on max(r, 5). If max(r, s) = 1, then G and G’ 
are cyclic groups of same prime power orders and so they are isomorphic. Assume 
that the result is true for all cases for which max(r,s) < m, m > 1. Suppose that 
max(r,s) = m inthe representation of G and G’ and o is an isomorphism from G 
to G’. Clearly, p”! is the maximum among the orders of G and p”! is the maximum 
among the orders of G’. Since under an isomorphism orders of elements remain the 
same, p”! = p”!,andson,; = my. Let 


(x1) = Biyr + Pay2 + +++ Bsys - (9.3.1) 
|x; |=| o(x) |= p™ andn, = m, => m; forall j. Thus, | 8;y; |= p”! for some 
j. After rearranging, we can assume that | G;y; |= p”! (note thatif| Gy; |= p™, 


then, since | 3; yj; | divides | y; |, | yj | = p”'). We show that 


G’ =<o(x1)> x <p> XX <Ys>. 


Since | yy: |= p™ =| Giyi |, (G1, p”!) = 1. By the Euclidean algorithm, there 
exist u,v € Z such that u3, + vp™ = 1. Hence, yy = ufiy, = u(o(x1) — 
Boy a Pe BsYs)- Since {y1, Y2; cry ys} generates G’, {o(%1), Y2; Cy Ys} also 


generates G’. Further, suppose that 
d1o(xX1) + d2y2 + +++ + dys = 0. 
Substituting the value of o(x,) from 1, we get 
opin a (61 2 + 62)y2 a a (018s + b5)Ys = 0. 
Since Gl =< yp > xX <y>xX-X <ys > diy = O1+ 2)y = ++ = 
(018; + 65)ys = 0. Since | Byy, |= p”™, p™ divides 6,. Hence, d;y; = 0 for all 
i > 2. This shows that 6;y; = 0 for alli > 2, and so dj0(x,) = O. Thus, 


G =<o(%1)>X <> XX <Yy,>. 
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Since o is an isomorphism from G to G’ such that o(< x; >) =< a(x;) >, it 
induces an isomorphism from G/ < x; > to G'/ < a(x) >. Also 


G/ <x) >¥< x2 > X+++X <x, > 


and 
G'/ < 0(x1) >&< yo > X00 X <Ys >. 


By the induction assumptionr — 1 = s—1landn,; = m; foralli > 1. tt 


Corollary 9.3.4 The number of isomorphism classes of abelian groups of order p" 
is p(n), where p(n) is the number of partitions of n. 


Proof From the above theorem, it is clear that there are as many isomorphism classes 


of abelian groups of order p” as many elements in the set {(71, 2,...,”,) | m1 > 
ng >++->n, and ny +no+---+n, = n}. The number of elements in this set, 
by definition, is p(n) the partition of n. ft 


Since any two finite abelian groups are isomorphic if and only if their Sylow 
subgroups are isomorphic, we have the following corollary: 


ny ng 


Corollary 9.3.5 Letn = p\' p>’... py, where pi, po, ..., pr are distinct primes. 
Then, the number of isomorphism classes of abelian groups of order n is p(n) - 
p(nz)... p(n,), where p is the partition function. tt 


Exercises 


9.3.1 Find the number of nonisomorphic abelian groups of order 144. Also list a 
member from each isomorphism class. 


9.3.2 Show that there are as many isomorphism classes of abelian groups of order 
Pi p>’... pe" as many conjugacy classes in S,, x +++ X Sh,. 

9.3.3 Let G and G’ be finite abelian groups such that for each n, G and G’ have 
same number of elements of order n. Show that G is isomorphic to G’. 


9.4 Normal Series and Composition Series 


Let G be a group with operator set Q. A Q-subnormal series of G is a descending 
chain 
G=G,2G,2---bPG,b Gry = {e} (9.4.1) 


of Q-subgroups of G, where G;,; < G; for all i. (Note that a term in a Q-subnormal 
series need not be Q-normal in G. Indeed, they are called Q-subnormal subgroups). 
If 2 = G, then a Q-subnormal series is simply said to be a subnormal series of G. 
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If Q = Inn(G), then all the terms of Q-subnormal series are normal subgroups of 
G, and in this case, we say that it is a normal series of G. If & = Aut(G), then 
all the terms of Q-subnormal series are characteristic subgroups, and in this case, 
we term it as a characteristic series of G. If Q = End(G), then all the terms of 
(2-subnormal series are fully invariant subgroups, and in this case, we term it as a 
fully invariant series of G. 

We say that the series | is without repetitions, if G; 4 G;+, for alli. The number 
n is called the length of the series 1. 

The {2-quotient groups G;/G2, G2/G3,..., Gn/Gn+, are called the factors of 
the series 1. 

Let 

G = MPD DH, > Any = {e}. (9.4.2) 


be also a {2-subnormal series of G. We say that (9.4.1) is refinement of (9.4.2) if there 
exists an injective map o from {1,2,...,m} to {1,2,...,n} such that H}) = Goi) 
for all i. Thus, if (9.4.1) is refinement of (9.4.2), then m <n. We say that (9.4.1) is 
a proper refinement of (9.4.2) if (i) (9.4.1) is refinement of (9.4.2), and (ii) m <n. 
A Q-subnormal series of a Q group G which is without repetitions is called a 
Q2-composition series of G, if it has no proper refinement which is a Q-subnormal 
series, and which is without repetitions. If 2 = @, then a Q-composition series is 
simply said to be a composition series of G. If 2 = Inn(G), then Q-composition 
series are called a principal or chief series of G. If Q = Aut(G), Q-composition 
series are called a principal characteristic series of G. If Q = End(G), then 
Q2-composition series are called a principal fully invariant series of G. 
Q2-subnormal series (9.4.1) and (9.4.2) are said to be equivalent, if 


(i) m = n, and 

(ii) there is a bijective map between the set of factors of (9.4.1) and that of (9.4.2) 
such that the corresponding factors are 2-isomorphic. We shall also express this 
by saying that the factors of (9.4.1) are Q2-isomorphic to the factors of (9.4.2) 
after some rearrangement. 


Example 9.4.1 Zt 6Z & 24Z > {0} is a normal series of the additive group Z of 
integers which is without repetitions. The length of this normal series is 3, the factors 
are Z,/6Z © Zo, 6Z/24Z * Za, and 24Z/{0} © Z. 


Example 9.4.2 The series S4 > A4 > V4 > {I} is a normal series of S4 which is a 
proper refinement of the normal series $4 > V4 > {7}, and both are without repeti- 
tions. 


Example 9.4.3 The series S4 > Ag > V4 > {7, (12)(34)} & {7} is asubnormal series 
of S4. Indeed, this is a composition series of S4, for it is without repetitions, and it 
cannot be refined further without admitting repetitions (note that there is no subgroup 
of S4 in between S4 and Ay except S4 and A,z, and similarly, there is no subgroup 
between A, and V4 except A4 and V4, etc.). The series S$4> Ag > V4 > {I} is a 
principal series of $4. Indeed, it is a principal Characteristic series of G. Is it a fully 
invariant series? 
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Example 9.4.4 The normal series Z > 8Z > 24Z > 48Z > {0} of Z is equivalent 
to the normal series Z > 3Z > 6Z > 48Z > {0} (verify). The subnormal series S4 > 
Ag > V4 > {7, 12)(34)} & {7} of S4 is equivalent to the subnormal series Sy > Ay > 
Va & {7, (13)(24)} & {7} (verify and write another subnormal series of $4 which is 
equivalent to the above two subnormal series). The normal series Cg > C3 > {e} and 
the normal series C6 > Cz > {e} of a cyclic group Ce of order 6 (C3 is the cyclic 
subgroup of order 3 of C6, and C2 is that of order 2) are equivalent. Note that they 
are also principal fully invariant series. 


Two subnormal series of a group may not have acommon refinement. For example, 
Sy > V4 > {7, 12)(34)} & {7} and Sy > V4 > {7, (13)(24)} & {7} have no common 
refinements. However, we have the following theorem of Schreier. 


Theorem 9.4.5 (Schreier) Let G be a group with operator set Q. Then, any two 
Q-subnormal series (8&-normal series) of G have equivalent refinements. 


We need the following theorem known as Zassenhauss Lemma or third isomor- 
phism theorem for the proof of the Schreier’s theorem. 


Theorem 9.4.6 (Zassenhauss Lemma) Let G be a group with operator set Q. Let 
G1, G2 be Q-subgroups of G. Suppose that G‘, is Q-normal subgroup of G\, and 
G5, a Q-normal subgroup of G2. Then, (G\ (| G5)G', is Q-normal subgroup of 
(G1 (| G2)G), and (G‘ (| G2)G is Q-normal subgroup of (G1 (| G2)G4. Fur- 
ther, the corresponding factors (G\ (\ G2)G‘,/(Gi (| G5)G), and (G1 (| G2)G4/ 
(G4 (| G2)G4 are Q-isomorphic. 


Proof Since G4 is Q-normal subgroup of G2, (Gi (| G4) is Q-normal in Gj). 
Again, since G‘, is Q-normal in G1, the product (G; (| G5)G‘, is a Q-subgroup, 
in fact, a Q-normal subgroup (product of &-normal subgroups are Q-normal sub- 
groups) of G,. Similarly, it can be observed that all subgroups in the lemma are 
Q-subgroups. Let h = uv € (Gi (| G5)G), where u € G, (|G), v € Gi, and 
g = xy € (Gi () G2)G), where x € G; (| Go and y € G).. Then, 


ghg"' = xyuv(xy)! = xux !(ux—!) y(ux x(vy ee 


1 


Observe that ux~' € Gi, and y € G}. Since G) I Gi, (ux7')~!y(ux7!) € G4. 
Also, since vy~! € Gi, and x € Gy, xvy-!x7! € G}. Thus, ghg-! = xux-'w, 
where w € G‘.. Further, since x, u € G1, xux—! € Gy. Again, u € G4, x € Go, and 
since G <1 G2, xux~! € G4. Hence, xux~! € G; () G5. This shows that ghg™! € 
(Gi (| G5)G). Thus, (G; () G5)G <I (Gi (| G2)G‘. Similarly, interchanging the 
role of indexes 1 and 2, we can show that (G{ (| G2)G4 <I (Gi () G2) G4. 

Now, we show that (G; (| G2)/(Gi (\ G5)(G) (| G2) is Q-isomorphic to 
(G1 (| G2)G)/(Gi (1) G5)G). Take H = (G,()G2), and K = (G,(\G4)G\. 
Then, H is a Q-subgroup of (Gi (\G2)G), and K a Q-normal subgroup of 
(G1 {| G2)G. By the Noether second isomorphism theorem, H/(H () K) is Q- 
isomorphic to H K/K (note that all isomorphism theorems hold for group with oper- 
ators). Now, HK = (G,() G2)(Gi (| G4)G, = (Gi (| G2)G', for (G1 1] G4) S 
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(Gi {| G2). Thus, HK/K = (Gi ()G2)G)/(Gi (|) G5)G). Further, H() K = 
(Gi NGG) MGNG) = (GiMG)G))MG2. Clearly, (G1 G) 
(GM G2) S (Gi 1 GG) 1) Gr. Let g € (Gi G4)G}) (} Go. Then, g € G 
and g = xy, where x € (G, (| G4) and y € G}. Since g,x € G2, y € Go. Thus, 
g = xy € (Gi () G5)(G) [) G2). This shows that H (| K = (Gi (| G5)(G, (1) G2), 
and hence, (G1 () G2)/(Gi (1) G5)(G{ 1) G2) is Q-isomorphic to (G1 1) G2)G{/ 
(Gi ()G4)G\. Interchanging the role of the indexes, we see that (Gj () G2)/ 
(G; (| G5)(G{ (]) G2) is Q-isomorphic to (G1 1] G2)G5/(G), (| G2)G4. The result 
follows from the fact that the relation of Q-isomorphism is symmetric as well as 
transitive. tt 


Proof of the Schreier’s theorem. Let 


CS Gib Gb G. 56.4 = ia (9.4.3) 


and 
G=HAHCHML:-. CA,& Ans = {e} (9.4.4) 


be two &2-subnormal series of G. For 1 <i <n+land1< j <m-+1, put 
Gij = (Gi) Gis, Grae = {e}, Hy = (Gi (| Hj) Aji. Hna2 = {e}. 


By the Zassenhauss lemma, G;j;+; is (2-normal in G;;, Hj.) is 2-normal in Hj;, 
and 
Gij/Gij+i © Ayi/Aji+1 (9.4.5) 


for all i, j. Now, Gi = (G; (|) M)Gien = (G{1 OGin1 = GiGi = Gi, 
and Gingi = (Gif) Hn+t)Gi+1 = {e}Gi+1 = Gi41. Thus, for each i, we have 
the segment 

G; = GulbGobt---& Gina = Gis, (9.4.6) 


where each term is &2-normal in the preceding term. Similarly, for each j, we have 
the segment 
AH, = Ay Ar---© Aimy = Aji, (9.4.7) 


where each term is (2-normal in the preceding term. Insert the segment (9.4.6) 
between G; and G;,, for all i in the Q-subnormal series (9.4.3), and the segment 
(9.4.7) between H; and Hj, for all j in the Q-subnormal series (9.4.4). We get 
refinements of (9.4.3) and (9.4.4), respectively, which are equivalent by (9.4.5). #f 


Remark 9.4.7 The proof of the Schreier’s theorem gives an algorithm to determine 
equivalent refinements of any two subnormal series provided we have an algorithm 
to determine HK and H (| K for subgroups H and K appearing in the subnormal 
series. For example, in additive group Z of integers, mZ(\nZ = [m.n]Z, and 
mZ + nZ = (m,n)Z, where [m, n] is l.c.m. and (m, n) is the g.c.d. of m and n. 
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Example 9.4.8 Consider the normal series 
Ze 12Z > 120Z > 240Z > {0} 


and 
Z & 25Z > 100Z > {0} 


of Z. Using the algorithm of the Schreier’s theorem, we get the refinement 
Ze 4Zp 12Zb 60Zb 120Z > 240Z > 1200Z > {0} 

of the first normal series, and the refinement 
ZO 5Z > 25Z > 100Z > 300Z > 600Z > 1200Z > {0} 


of the second normal series which are equivalent. 


Example 9.4.9 We determine equivalent refinements of 
ReQeZe {0} 


and 
Re Q+27Z b> 27nZ Pb {0}. 


Here,Gi2 = (R(IV(Q4+27Z)) + Q=Q4 29724 Q0=Q 4+ 222, G3 = 
(R() 27Z) +Q0=Q+ 2nZ, Gig = Q = G2. Further, G23 = (Q()27Z) + 
Z = {0} + Z = Z. Observe that Q(|27Z = {0}, for z is irrational. Similarly, 
Gu = Z = G3 and G33 = {0}. Thus, we get a refinement 


R>Q+27Z> Qe Ze {0} 
of the first normal series, and similarly, we get a refinement 
R>Q+2rZ>Z+4+27Z > 2nZ > {0} 


of the second normal series which are equivalent. 


Proposition 9.4.10 We can remove all repetitions from any two equivalent Q- 
subnormal series of a group without affecting their equivalence. 


Proof The proof is by the induction on the length of normal series. If both the Q- 
subnormal series have length of 1, then there is nothing to do. Suppose that the result 
is true for all (2 equivalent subnormal series which have length n. Let 


Ca Oe Geet ECLEeC Eee Se (9.4.8) 
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and 
G=AMebME--- FA, © Anyi © An+2 = fe} (9.4.9) 


be (2-equivalent subnormal series. If there are no repetitions, then there is noth- 
ing to do. Suppose that G; = Gj;+;. Since (9.4.8) is equivalent to (9.4.9), there is a 
bijection o from the set {G;/G2, G2/G3,..., Gn41/Gn+2} to {AM / Ho, Ho/H3,..., 
An41/Hn+2} such that the corresponding factors are -isomorphic. Let 
0(G;/Gis+1) = Hy /Hj+1 ~ G;/Gj41. Since G; = Gj, Hi; = Aj41- Remov- 
ing G;,, from | and Hj; from 2, we get two Q-equivalent subnormal series of 
length n, and so by induction assumption, we can remove all repetitions from these 
Q-subnormal series without affecting their equivalence. tt 


&2-Composition Series 


A group need not have any composition series. For example, the additive group Z 
of integers have no composition series: given any normal series Z > miZ > mZb> 
---m,Z & {0}, which is without repetitions, we have a proper refinement Z > m,Z > 
mL>--+-&m,Z > 2m;,Z & {0} which is without repetitions. However, if a group 
has a (2-composition series, then it is essentially unique. 


Theorem 9.4.11 (Jordan Holder Theorem) Any two Q-composition series ofa group 
are equivalent. 


Proof By the Schreier’s theorem, any two (2-composition series of a group will have 
equivalent refinements. A (2-composition series, by definition, is a (2-subnormal 
series without repetitions, and which cannot be refined further without admitting 
repetitions. Thus, if we remove all repetitions from the equivalent refinements of the 
two composition series, then we arrive at the original composition series. From the 
above proposition, removing all repetitions from equivalent subnormal series does 
not affect their equivalence. It follows that any two composition series of a group are 
equivalent. ft 


Taking Q = @ (Inn(G), Aut(G), End(G)), we get the following corollary: 


Corollary 9.4.12 Any two Composition series (Principal series, Principal Charac- 
teristic series, Principal Fully invariant series) are equivalent. tt 


Let G be a group which has a composition series. Then, the length of any two 
composition series are same. This common number is called the composition length 
or Jordan Holder length of G, and it is denoted by /(G). Thus, /(S4) = 4, for 
Sa Ag > V4 b> {7, (12)(34)} & {7} is a composition series of S4. For n > 5, S, > 
A, > {I} is the only composition series of S,,. Thus, /(S,) = 2 forn > 5. Similarly, 
(Qs), = 3,l(Aa) = 3,1(An) = 1 forn >5.1(G) = | if and only if G is 
simple. An isomorphism takes a composition series to a composition series, and 
hence, isomorphic groups have same composition length. Thus, composition length 
is an invariant of the group structure(G ~ G’ implies that 1(G) = 1(G’)). But 
l(G) = 1(G’) does not imply that G ~ G’ (1(S3) = 2 = 1(S5)). 
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Let 
G= Gi > G2b-:->Gr> Gas = {e} 


be a Q2-composition series of a 2-group G. Then, it cannot be refined further without 
admitting repetitions. Thus, we cannot put anything in between G; and G;+ for any 
i. This means that Gj; is maximal (2-normal subgroup of G; for all i. Conversely, 
if each Gj, is a maximal (2-normal subgroup of G; for all 7, then we cannot refine 
the given Q2-subnormal series further without admitting repetitions. Thus, we have 
the following proposition. 


Proposition 9.4.13 A Q-subnormal series 
G = Gi>Gr>::-b>G,zb Gay = fe}, 


is a 82-composition series if and only if each G;4, is maximal Q-normal inG;. ff 


Clearly, H is maximal Q-normal in G if and only if G/# is nontrivial (2-simple 
in the sense that it has no proper Q2-normal subgroup. Thus, we have the following: 


Corollary 9.4.14 A Q-subnormal series is a Q-composition series if and only if all 
its factors are nontrivial Q-simple groups. tt 


Since factors of equivalent Q-subnormal series are {2-isomorphic, and a group 
which is Q2-isomorphic to a Q-simple group is &2-simple, from the previous corollary, 
we get the following corollary. 


Corollary 9.4.15 A Q-subnormal series, which is equivalent to a Q-composition 
series, is a §2-composition series. tt 


Proposition 9.4.16 Let G be a group with operator Q which has a Q-composition 
series. Then, any 92-subnormal series of G which is without repetitions can be refined 
to a {2-composition series. 


Proof Let 
G = G,>Go>b::-&>Grzb Gry = fe} (9.4.10) 


be a Q-composition series and 
G=HWCMP::-bAn> Anu = {e} (9.4.11) 


a Q2-subnormal series without repetitions. By the Schreier’s theorem, (9.4.10) and 
(9.4.11) have equivalent refinements. After removing the repetitions from the refine- 
ment of (9.4.10), it reduces to (9.4.10), and after removing the repetitions from the 
refinement of (9.4.11), it will remain to be a refinement of (9.4.11). Thus, there 
is a refinement of (9.4.11) which is equivalent to the composition series (9.4.10). 
Since any (2-subnormal series which is equivalent to a (2-composition series is a Q 
composition series, (9.4.11) has a refinement which is a Q2-composition series. 
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Proposition 9.4.17 Let G be a group with operator set Q which has a Q-composition 
series. Let H be a Q-normal subgroup of G. Then, H and G/H both will have Q- 
composition series, and 


W(G) = 1(H) + U(G/H) 


Proof lf H = {e} or H = G, then there is nothing to do. Suppose that H ¥ {e} 
and H #£G. Then, G> AP {e} is a Q-subnormal series of G which is without 
repetitions. From the above proposition, it can be refined to a (2-composition series 


G=G,PG2~P::->Gi_|OAD-::-> Gay = {e}. 


Then, 
HD Gis Deb Gay = {e} 


is a Q composition series of H. Further, by the first isomorphism theorem, 
(G;/H)/(Gj41/H) is Q-isomorphic to G;/Gj+4; for all j <i—1. Hence, 
(G;/H)/(Gj41/H) is Q-simple for all 7 < i — 1. Thus, 


G/H = G,/H > G2/H ©--->G;_1|/H > {HA} 


is a Q2-composition series of G/H. Clearly, /(G) = 1(H) + I(G/H). tt 


The following corollary is a consequence of the fundamental theorem of homo- 
morphism, and the above corollary. 


Corollary 9.4.18 Let G be a Q-group which has a Q-composition series. Then, any 
Q-homomorphic image of G has a Q-composition series. If f from G to G' is a 
surjective Q-homomorphism, then1(G) = I(G') + I(ker f). tt 


Proposition 9.4.19 Let H be a Q-normal subgroup of G such that H and G/H 
both have Q-composition series. Then, G has a & composition series, and I(G) = 
I(H) + I(G/H). 


Proof Let 
H = > A>: Anyi = {e} 


be a Q-composition series of H, and 
G/H = Gi/H © Go/H ©-:-& Gaii/H = {H} 


a Q-composition series of G/H. Then, H;/Hj+; is Q-simple. Further, G;/Gj+1, 
being &2-isomorphic to (G;/H)/(G;+,/H), is Q-simple. Hence, 


G = G)PG2b-++-b Gngi > Ay b--+ > Anyi = fe} 


is a Q-composition series of G(note that G,,,; = H). Clearly, /(G) = 1(H) + 
(G/F). t 
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Proposition 9.4.20 Let H, be a Q-subgroup of G, and Hz a Q-normal subgroup 
of G. Suppose that H, and Hy have (2-composition series. Then, H, Hy has a Q- 
composition series, and 1(H, Hy) = 1(H,) +1(H2) — 1H, ()\ Ap). 


Proof By the Noether isomorphism theorem, H) H2/H> ~ H,/ Hj, (\ Hz. Since Hi 
has a Q-composition series, H)/H; (| H> also has a Q-composition series. Hence, 
FH, Hy / Hy has a Q-composition series. Since H) is supposed to have a (2-composition 
series, by Proposition 9.4.19, H; Hz has a 2-composition series. Also/(H, H)/H 2) = 
1(H,/H, (| Hz). Hence, 1(H) Hy) — 1(Hy) = 1(H\) — 1A; () Ad). t 


Proposition 9.4.21 Every finite Q-group has a Q-composition series. 


Proof The proof is by induction on order of G. If | G | = 1, there is nothing to do. 
Assume that the result is true for all those groups whose orders are less than n. Let G 
be a Q-group of order n. If G is Q-simple, then G > {e} is the 2-composition series of 
G. Suppose that G is not Q-simple. Let H be a nontrivial proper normal Q-subgroup 
of G. Then, | H | < n, and also | G/H |< n. By the induction hypothesis H and 
G/H both have Q2-composition series. By Proposition 9.4.19, G has a Q-composition 
series. tt 


Remark 9.4.22 The main guiding problem in the theory of finite groups is the clas- 
sification of finite groups, or in particular, classification of finite groups of a given 
order. Every finite group has a composition series. As such, this classification prob- 
lem reduces to the following two problems: 

Problem 1 Classify all finite simple groups. 

Problem 2 Given finite groups H and K, classify the class of groups G having H 
as anormal subgroup with G/H isomorphic to K. 

The classification of finite simple groups was achieved in 1980. There are four 
types of finite simple groups to be described in another volume of the book. However, 
the solution to the problem 2 is beyond dream to mathematicians, and usually, partial 
solutions to this problem are addressed in the theory of extensions and cohomology 
of groups (see Chap. 10, Algebra 2, or any book on cohomology of groups). 


Proposition 9.4.23 An abelian group has a composition series if and only if it is 
finite. 


Proof If G is finite then, by the Proposition 9.4.21, it has a composition series. Let 
G be an abelian group, and 


G= Gi>Goe-:-> GzP Grit = {e} 


a composition series of G. Then, G;/G;+, is abelian as well as simple for all 7. Since 
an abelian group is simple if and only if it is a cyclic group of prime order, G; /Gj+ 
is finite cyclic group of order p;, where p; is some prime. Thus, G, ~ G,,/{e} is 
finite of order p,. Since G,_;/G, is finite of order p,-1, Gy_1 is also finite of 
order Pn—1 Pn (observe that if H is finite and G/H is also finite, then G is also finite 
and |G|=|H|-|G/A |). Similarly, G,_2 is also finite of order p,—2 Pp—1Pn- 
Proceeding inductively, we find that G is finite of order p; po... Pn. tt 
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Recall that a subgroup H of a group G is called a subnormal subgroup of G if it 
appears in a subnormal series of G. We write H <1 SG to say that H is a subnormal 
subgroup of G. A subnormal subgroup need not be normal. For example {7, (12) (34)} 
is a subnormal subgroup of S4 (S4 > V4 > {7, (12)(34)} & {7} is a normal series of 
S4). An arbitrary subgroup need not be a subnormal subgroup for example {/, (12)} 
is a subgroup of $3 which is not subnormal ($3 > A3 > {J} is the only nontrivial 
normal series of 53). 

The following proposition is an immediate consequence of the fact that the inverse 
image of a normal subgroup under a homomorphism is a normal subgroup, and the 
image of a normal subgroup under a surjective homomorphism is a normal subgroup. 


Proposition 9.4.24 Inverse image of a subnormal subgroup under a homomorphism 
is a subnormal subgroup, and under a surjective homomorphism, image of a sub- 
normal subgroup is a subnormal subgroup. tt 


Corollary 9.4.25 In the correspondence theorem, subnormal subgroups corre- 
spond. t 


Corollary 9.4.26 A subgroup K/H of G/H is subnormal if and only if K is sub- 
normal in G. t 


Proposition 9.4.27 A group G is simple if and only if it has no nontrivial proper 
subnormal subgroups. 


Proof \fa group G has no nontrivial proper subnormal subgroups, then, in particular, 
it has no nontrivial proper normal subgroups, and so, it is simple. Conversely, if G 
is simple, then G > {e} is the only normal series of G, and so, it has no nontrivial 
proper subnormal subgroups of G. tt 


Remark 9.4.28 It may be an interesting problem to study groups all of whose sub- 
groups are subnormal. Finite groups all of whose subgroups are subnormal are known 
as nilpotent groups. We shall discuss such groups in Chap. 10. 


Proposition 9.4.29 Let G be a group which has a composition series, and H <1 1G. 
Then, H has a composition series, and|(H) < I(G). 


Proof Since H <1 <1G, there is a normal series, which we can take to be without 
repetitions, and in which H is a term. Since G has a composition series, it can be 
refined to a composition series of G. The subchain of this composition series starting 
from H onward is a composition series of H. Clearly, /(H) < 1(G). tt 


Example 9.4.30 An arbitrary subgroup of a group need not have a composition series 
even if the group has a composition series. Consider, for example, the group Ago 
which is simple (Proposition 6.2.43), and so, it has composition series. Let H be the 
subgroup of A. generated by the set X = {(4n 4n + 1)(4n +2 4n+3) |n © N}. 
Since X is infinite and elements of X commute, H is infinite abelian group. It follows 
that H has no composition series. 
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Now, we find a necessary and sufficient condition for a Q-group to have a Q- 
composition series. 

A Q-group G is said to satisfy descending chain condition (D.C.C.) for Q- 
subnormal subgroups if given any chain 


G= G{,>G2b-:-bG,bGrayF::-, 


where each G,,;; is normal Q2-subgroup of G,,, there exists ng € N such that G, = 
G,, for alln > no. 

A Q-group G is said to satisfy ascending chain condition (A.C.C.) for Q- 
subnormal subgroups if given any ascending chain 


H, Ay <.-- dH, IA, <--> 


of Q-subnormal subgroups of G, there exists m9 € N such that H, = 4, for all 
n> No. 

One notices that if G satisfies D.C.C. (A.C.C.), then any Q-subnormal subgroup 
of G and any (2-quotient group of G also satisfies D.C.C. (A.C.C.). 


Example 9.4.31 A finite Q-group satisfies D.C.C. as well as A.C.C., for we cannot 
have an infinite properly ascending or properly descending chain of (2-subgroups. 


Example 9.4.32 The additive group Z of integers satisfies A.C.C., for given any 
subgroup mZ of Z, mZ C nZ if and only if n is a proper divisor of m. Since there 
are only finitely many proper divisors of m, only finitely many subgroups of Z can 
contain m Z. It does not satisfy D.C.C., for we have an infinite properly descending 
chainZ > 2Z > 2Zb.---2"Zp>2"'1Zb..- of Z. 


Example 9.4.33 A group may satisfy D.C.C. but not A.C.C.: Let p bea prime integer, 
andQ, = Le | m € Z,r € NU{0}}. Then, Q, is a subgroup of the additive group 
of rationals, and since it is abelian, all its subgroups are subnormal. LetG = Q,/Z. 
We have an infinite properly ascending chain 
{Z} <J {L/L < pL/Z<---< L/L > 

of subgroups of G. Thus, G does not satisfy A.C.C. We show that it satisfies D.C.C. 
Let H be a subgroup of Q,, containing Z properly. Then, for some natural number 
m co-prime to p anda € N, oe e€ H —Z. By the Euclidean algorithm, there exist 
u,v € Zsuch that um + vp® = 1. Hence, = =vt+ ua € H. Thus, a e€ Hif 


=o 
Pp 


and only if — € H. Let ap be the least positive integer such that aa € H. Then, 


H = {ty |meZ} = pay Le 


Clearly, there are only finitely many subgroups of Q, contained in H. Thus, there 
are only finitely many subgroups of G contained in the subgroup H/Z of G. This 
means that G satisfies D.C.C. 
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Remark 9.4.34 Since Q, does not satisfy (as G does not satisfy) A.C.C., and since it 
is a subgroup of the additive group Q, the additive group Q of rationals also does not 
satisfy A.C.C. Observe that Q does not satisfy D.C.C. also (for otherwise Z would 
satisfy D.C.C). Clearly, the additive group of reals, the additive group of complex 
numbers, the multiplicative group of nonzero reals, the multiplicative group complex 
numbers, and the circle group do not satisfy any of the chain conditions. 


Theorem 9.4.35 A Q-group has a Q-composition series if and only if it satisfies 
both the chain conditions for Q2-subnormal subgroups. 


Proof Suppose that G has a (2-composition series, and /(G) = ng. Since every 
Q2-subnormal series without repetitions can be refined to a {2-composition series, 
and any two Q-composition series are equivalent, we cannot have a (Q2-subnormal 
series without repetitions which is of length greater than no. Thus, G satisfies D.C.C., 
for otherwise, we can extract a (Q2-subnormal series without repetitions of arbitrary 
length. G also satisfies A.C.C., for otherwise, we can extract a properly ascending 
chain 
A, <I Ay +++ d Any 4+1 


of 82-subnormal subgroups. Since H,,,+1 is a 82-subnormal subgroup of G, the above 
segment can be enlarged to a (Q2-subnormal series of G which is without repetitions, 
and whose length, therefore, is at least m9 + 1. This is a contradiction to the fact that 
there is no (2-subnormal series without repetitions of length greater than no + 1. 
Conversely, suppose that G satisfies A.C.C. as well as D.C.C. We first show that 
G has a maximal normal Q-subgroup. The trivial subgroup {e} may be maximal 
normal &2-subgroup. If not, there is a nontrivial proper normal (Q2-subgroup AH; of 
G. {e} < HM, <1 G. HM, may be maximal normal Q2-subgroup. If not, there is a normal 
Q-subgroup H2 of G such that {e} <4 H; <| Hy <1 G. Hy may be maximal normal Q2- 
subgroup. If not, proceed in the same way. This process stops after finitely many steps 
giving us a maximal normal 2-subgroup, because of A.C.C. Let Gz be a maximal 
normal (2-subgroup of G. If G2 = {e}, then G > {e} is a Q-composition series of G. 
Suppose that G2 ¢ {e}. Then, since every Q-subnormal subgroup of G also satisfies 
A.C.C., from the previous argument, G2 will have a maximal normal Q-subgroup G3 
(say). If G; = {e}, then G > G2 > {e} is a composition series of G. If not, proceed 
similarly. This process stops after finitely many steps giving us a (2-composition 
series, because of D.C.C. tt 


Taking Q = Inn(G), we get the following corollary: 


Corollary 9.4.36 A group has a principal series if and only if it satisfies both the 
chain conditions for normal subgroups. tt 


Exercises 


9.4.1 Find refinements of 
Z > 8Z > 48Z > {0} 
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and 
Zo 7Z > 42Z > {0} 


which are equivalent. 


9.4.2 Find refinements of 
RoQ+ V2Z> Zp {0} 


and 


RoQ + V3Zp> Zp {0} 


which are equivalent. 
9.4.3 List all composition series of Z12, and also find their length. 


9.4.4 Let <a > be acyclic group of order pj'' p5?... pe’, where pi, p2,.--, Pr 
are distinct primes. Find a composition series of G and show that its length is a; + 
Q2 + --- + a,. Find the number of composition series of G. 


9.4.5 Prove the fundamental theorem of arithmetic using the Jordan Holder Theo- 
rem. 


9.4.6 Suppose that H and G/H both satisfy A.C.C. (D.C.C.). Show that G also 
satisfies A.C.C. (D.C.C.). 


9.4.7 Let H <1 G. Show that G/H has a composition series if and only if following 
two conditions hold. 
(i) Given any chain 
A, HW,>-::-bHA,D-::: 


of subnormal subgroups of G containing H, there exists 9 € N such that H, = H,, 
for alln > no. 
(ii) Given any chain 


G=G,0G2b-::-b>G,zrb Gr b::: 
such that each G,,+; is normal in G,,, there exists ng € N such thatG,H = G,,H 
for alln > no. 


9.4.8 Show that none of the groups (i) (S', -), (ii) (R*, -) and (iii) (C*, -) satisfy 
any of D.C.C. or A.C.C. 


9.4.9 Let G be a group all of whose subgroups are subnormal subgroups. Then, 
show that G has a composition series if and only if it is finite. 


9.4.10 Let G and G’ be finite abelian groups such that /(G) = 1(G’). Show that 
|G|=|G'|. 
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9.4.11 Let H < <G and K a subgroup of G. Show that H () K (<iK. 


9.4.12 Show that the intersection of two subnormal subgroups is a subnormal sub- 
group. 


9.4.13 Let H and K be subgroups of G which have composition series. Suppose 
that HK = KH. Can we conclude that HK has a composition series? 


9.4.14 Show that G satisfies A.C.C. for subgroups if and only if every subgroup of 
G is finitely generated. 


9.4.15 Let G be a group which satisfies A.C.C. for subgroups. Show that G has a 
maximal subgroup. 


9.4.16 Show that the additive group Q of rationals, the additive group R of reals, 
the multiplicative group R* of nonzero reals, the multiplicative group R* of positive 
reals, the additive group C of complex numbers, the multiplicative group C* of 
nonzero complex numbers, the circle group S', and the group P of roots of unity 
have no maximal subgroups and none of them are finitely generated. 


9.4.17 Does the group of nonsingular n x n matrices over reals satisfy A.C.C.? 
Does it have a maximal subgroup? 


9.4.18 Does the group of n x n matrices of determinant | over reals satisfy A.C.C.? 
Does it have a maximal subgroup? 


9.4.19 Find all principal series, principal characteristic series, and also principal 
fully invariant series of S4. 


9.4.20 Show that a group which has a composition series has also a principal series. 
In turn, show that it also has a principal characteristic, and a principal fully invariant 
series. Is the reverse implication true? Support. 


9.4.21 Give an example of an indecomposable group which has no principal series. 


9.4.22* What can we say about union of a chain of subnormal subgroups? Is it always 
subnormal? 


9.4.23* Let H and K be subnormal subgroups of a finite group. Show that the sub- 
group generated by H |) K is subnormal. Can we conclude the result in infinite 
groups also? 


9.4.24* Let H and K be subnormal subgroups of G such that HK = KH. Show 
that HK is subnormal. 


Chapter 10 
Structure Theory Continued 


This chapter deals with the Remak-Krull-Schmidt Theorem on direct decomposition, 
structure theory of solvable and nilpotent groups together with the presentation theory 
of groups. 


10.1 Decompositions of Groups 


We refer Sect.5.2 of Chap.5 to recall the basic definitions and properties of direct 
products. In this section, we establish the Remak—Krull—Schmidt Theorem and study 
completely decomposable (in particular semisimple) groups. We also discuss other 
type of decompositions of groups. 


Theorem 10.1.1 (Remak—Krull—Schmidt) Let G be a group with operator Q which 
has a Q2-principal series (equivalently, it satisfies A.C.C and D.C.C for normal 
Q-subgroups). Then, G can be expressed as a direct product of finitely many Q2- 
indecomposable subgroups. Further, the representation of G as direct product of 
Q2-indecomposable subgroups is essentially unique in the following sense: If 


G = H, x H)x.:--x H, = K, x Ko x:-:- xX Kn, 


where H; and K; are 82-indecomposable subgroups of G, then 

(i)m=n, 

(ii) there is a central automorphism o of G such that after some rearrangement 
o(H;) = K; foralli, and 


G = Ki x K2x---x K, X Aya X Apen X +++ X Ay. 


forallr <n. 
In particular, we have the following corollary: 
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Corollary 10.1.2 If G is a group which has a principal series (characteristic series, 
fully invariant series), then G can be expressed as a direct product of finitely many 
indecomposable (characteristically indecomposable, fully invariantly indecompos- 
able) subgroups. Further, then, the representation of G as direct product of inde- 
composable (characteristically indecomposable, fully invariantly indecomposable) 
subgroups is essentially unique in the following sense: If 


G = A, x H)x:--x H, = K, x Ky x---X Kn, 


where H; and Kj; are indecomposable (characteristically indecomposable, fully 
invariantly indecomposable) subgroups of G, then 

(i)m=n, 

(ii) there is a central automorphism o of G such that after some rearrangement 
o(H;) = K; foralli, and 


G = Ki x K2x---x K, X Hyyy X Apen X +++ X Ay. 


forallr <n. ft 
We need some Lemmas to prove the Remak—Krull—-Schmidt Theorem. 


Lemma 10.1.3 Let G be a group with operator Q. Let o be a normal 
Q-endomorphism of G such that o°*(G) = o(G) and kero? = kero. Then, 
G = a(G) x kero. 


Proof Since o is anormal (2-endomorphism of G, 7(G) is a normal Q2-subgroup of 
G. Let g € G. Since o*(G) = o(G), there is an element h € G such that o(g) = 
o*(h). But, then, (a(h))-'g € kero. This shows that g € o(G)kero. Thus, G = 
a(G)kero. Let g € o(G) (| kero. Then, g = o(h) for some h € G. Also, e = 
a(g) = o*(h). Hence, h € kero? = kero. This shows thatg = o(h) = e. Thus, 
a(G)(\kero = {e}. From Corollary 5.2.25, it follows thatG = o(G) x kero. 


Lemma 10.1.4 (Fitting) Let G be a group with operator Q which has a Q-principal 
series. Let o be anormal (2-endomorphism of G. Then, there exists a natural number 
m such that o"(G) = o'(G) foralln >m,andG = o"(G) x kero”. 


Proof Since o is anormal (2-endomorphism of G, image of a normal Q-subgroup of 
G is again a normal Q-subgroup of G. In turn, 0”(G) is a normal Q-subgroup 
of G for all natural number n. Since G has a Q2-principal series, it satisfies 
D.C.C as well as A.C.C for normal Q-subgroup of G. Thus, the descending 
chain G D o(G) D o7(G) D......... Do"(G) Do" "(G) D...... , and the ascend- 
ing chain kero C kero © vee C kero” C.... eventually terminate after finitely 
many terms giving us a natural number m such that o"(G) = o”(G) and kero” = 
kero” for alln > m. Put p = o”. Then, p°(G) = 0?"(G) = o"(G) = p(G) 
and kerp? = kero?” = kero™ = kerp. From the previous lemma, G = 
p(G) x kerp = o"(G) x kero”. tt 
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Corollary 10.1.5 Let G be a Q-indecomposable group which has a Q-principal 
series. Let 0 be a normal {2-endomorphism of G. Then, either o is nilpotent (in 
the sense that for some m o(g) = e forall g € G), or else it is a central Q- 
automorphism of G. 


Proof From the Fitting Lemma, there is a natural number m such that G = 
a” (G) x kero”. Since G is indecomposable, 0"(G) = G and kero” = {e}, 
orelse 0” (G) = {e} and kero” = G. The result follows. tt 


Recall that two Q-endomorphism o; and o2 are summable if 7; + 2 is also an 
endomorphism (and so also, a (2-endomorphism) of G. Indeed, it is so if and only 
if the elements of 0;(G) commute with elements of o2(G). Further, if 0, and a2 are 
summable, and o is anendomorphisms, then cog, and 002 (a, 00 and 0200) are also 
summable, and go(o; + 02) = 000, + G002 ((0, + o2)00 = 0,00 + 0200). 


Corollary 10.1.6 Let G be a nontrivial Q-indecomposable group which has a Q- 
principal series. Let {0,, 02, ..., On} be a set of pairwise summable normal Q- 
endomorphisms. Further, suppose that {0, + 02 + -+++ oy} is an automorphism. 
Then, 0; is a central automorphism for some i. 


Proof By induction, it is sufficient to prove the result for n = 2. Let o; and a2 be 
summable normal {2-endomorphisms such thato = 0; + 02 is an automorphism. 
Suppose also that neither a; nor 02 is an automorphism. Then, from the above 
corollary, 0; and o are both nilpotent. As observed above Tt; = o7!oa, and7) = 
o~'oo is a pair of summable normal Q-endomorphisms such that 7; + = Ig. 
Also, since a; and 2 are nilpotent, and o is an automorphism, it follows that none 
of the 7; and 72 are automorphisms. As such, they are nilpotent. Let m be a natural 
numbers such that 7/” and 73” are trivial maps. Also, MoT; + 77 = TMolg = 
[got = T0T + 7. This shows that 7,072 = 7207,. We can use the bionomial 
theorem to conclude that (7, + 7)?” is also a trivial map. This is a contradiction, for 
T, +72 is the identity map on G. Hence, the supposition that neither a; nor a2 is an 
automorphism is false. ft 


Lemma 10.1.7 Let G be a Q-group which has a 2-principal series. Then, every 
normal injective (surjective) endomorphism of G is a central automorphism of G. 


Proof Let a be an injective normal (2-endomorphism of G. Then, o” is injective for 
each n. Since G has a Q-principal series, o”(G) = o”*t!(G) for some n. Let x € G. 
Then, there is a y € G such that o”(x) = o”t!(y). Since o” is injective, x = o(y). 
This shows that o is also surjective. Next, suppose that o is a surjective normal Q2- 
endomorphism of G. Then, o” is surjective for each n. Since G has a (2-principal 


series, kero" = kero"*! for some n. Suppose that (x) = e. Since o” is surjective, 
x = o"(y) forsome y € G. But, theno”t!(y) = e,andso y € kero"*! = kero". 
Hence, x = o”(y) = e. This shows that c is injective also. tt 


Proof of Theorem 10.1.1 Existence of decomposition: Let G be a group with operator 
set Q which has a (2-principal series. The proof of the existence of the decomposition 
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is on the induction of the length of the principal series. If the length of the Q principal 
series of G is 1, then it is Q2-simple, and so it is indecomposable. Assume that the 
result is true for all those Q-groups which have (2-principal series of length less 
than that of G. If G is Q-indecomposable, then there is nothing to do. Suppose that 
G is {2-decomposable. Suppose thatG = H x K, where H and K are nontrivial 
proper (2-normal subgroups of G. Clearly, H and K have {2-principal series of length 
less than that of G. By the induction hypothesis, H and K both are direct products 
of finitely many Q2-indecomposable groups. In turn, G can be expressed as direct 
product of Q2-indecomposable groups. 

Uniqueness of the decomposition: The proof is by the induction on max(m, n). 
Suppose that max(m, n) = 1. Then, since G is nontrivialm = 1 = n, and there 
is nothing to do. Assume the induction hypothesis. Suppose that 


G = H, x H)x:--x H, = K, x Ko x-:: xX Kn, 


where H; and K; are {2-indecomposable subgroups of G, and max(m, n) > 1. Let 
Pi, P2, ---» Pn be the projections corresponding to the decomposition 


G = HA, x Hy x:--x M, 
and qi, 92, ---, 4m that corresponding to the decomposition 
G = K, x Kyx:::X Kn. 


Then, {71, P2, ---, Pn} and {q1, qo, ---, Gm} are sets of pairwise summable normal 
and idempotent &2-endomorphisms such that 


Pit pote + Pn = Ie = i t+qat+ +++ +4, 


and 
piopj = 0 = qpog fori A jandk Al. 
Thus, 


m 


Pi = piop, = piolgop, = > progiop 
1 


Since p; restricted to H, is the identity map Jy, on H,, it follows that 


m 


Ty, = yn. 
1 


where 77; denote the restriction of p,;oq; to H,. Clearly, {7;|1 <i < m} is the set pair- 
wise summable normal (2-endomorphisms of H. It follows from Corollary 10.1.6 
that 7; is a central Q-automorphism of Hj for some 7. After rearranging, we may 
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assume that 7 is a central Q-automorphism of H;. As such, q; restricted to Hj is 
an injective Q-homomorphism from H to K;, and p; restricted to K; a surjective 
Q-homomorphism from K, to H; which commute with all induced inner automor- 
phisms of G on H; and K,, respectively. Let p; denote the restriction of gop, to 
K,. Then, p; is a normal Q2-endomorphism of K,. Now, 
Image pi 
= pi(K1) 
= (91 P11 P1)(K1) 
= (gipigi)(M) (for p; restricted to K; a surjective map from K, to HM) 
= qi(A)) (for piqi restricted to H; is an automorphism of A) 
= (gi pi)(K1) (for p restricted to K; a surjective map from K, to M;) 
= Image p. 
Also, 
ker ra 
={x € Ki | qipigipi(x) = e} 
= {x € Ki | (gipi(x) = e} (for pq restricted to H, is an automorphism) 
=ker py. 
From Lemma 10.1.3, it follows that K; = p,(K1) x kerp,. Since K, is indecom- 
posable, p\(K,) = K, and kerp, = {e}, orelse p;(K,) = {e}andkerp, = kK. 
Since p; restricted to Kj is surjective map from K, to Hj and q; restricted to Hy 
is injective, kerp; # K,. Thus, p;, which is the restriction of gijop; to Kj, is a 
normal Q2-automorphism of K,. This ensures that p; restricted to K, is an injec- 
tive Q-homomorphism from K, to Hj, and q; restricted to Hj is a surjective Q- 
homomorphism from H, to K;. Consequently, p; restricted to K, is a Q2Q-isomorphism 
from K, to Mj, and q; restricted to Hj is a Q2-isomorphism from H to K; which 
commute with all inner automorphisms of G. 

Now, consider the subgroup L = K,A>A.......... H,, of G. Clearly, K; and 
A Hy.......... H,, are normal subgroups of L. Let x € K, ()(HoH.......... H,,). Then, 
qi(pi(x)) = e. Since q; p; restricted to K,; is an automorphism of K,, it follows 
that x = e. Thus, K; (\(#oA.......... H,) = {e}. This shows that L is the direct 
product K, x Ay x Ay x... x H,,. It also follows that qip; + p2 + p3 + 
ssdengys + Pn is a normal injective endomorphism of G whose image is L. From 
Lemma 10.1.7, it follows that L = G. Clearly, qipi + po + p3 + «ee. + Pn 
is acentral automorphism of G which takes H; isomorphically to K,. It also induces 
Q-isomorphism from G/H, = Hp» x H3x........ x H, to G/K, = Hx Hz x 
naeeeees x Hy, = Ko x K3 X ........ X Km. The result follows by the induction. tt 


Exercises 

10.1.1 Show that S,,, A,, Qg, and Dg are indecomposable groups. 

10.1.2 Show that any extra special p-group is indecomposable. 

10.1.3 Show that Qg x Qg is not isomorphic to Qg x Ds. 

10.1.4 Express all groups up to order 15 as direct product of indecomposable groups. 


10.1.6 Is the additive group R of real numbers indecomposable? Support. 
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10.2 Solvable Groups 


Let G bea group. Recall that an element of the form aba~!b~! is called acommutator, 
and it is denoted by (a, b). Notice that (a, b)ba = aband (a,b) = eifandonly ifa 
and b commute. Recall further that the subgroup [G, G] generated by all commutators 
is called the commutator or the derived subgroup of G. Since (a, b)-! = (b,a)isa 
commutator, every element of [G, G] is a product of commutators. The commutator 
subgroup [G, G] is characterized (see Theorem 5.2.34 and Remark 5.2.35) by the 
property that it is the smallest normal subgroup of G by which if we factor we get 
an abelian group, or equivalently, G/G’ is the largest quotient group of G which is 
abelian. G/G’ is also called the abelianizer of G. Further, if a subgroup H contains 
G’, then it is necessarily normal. Clearly, G’ = {e} if and only if G is abelian. If G 
is a nonabelian simple group, then G’ = G. A group G is said to be a perfect if 
its commutator subgroup G’ is G itself. Thus, a nonabelian simple group is always 
perfect. Observe that the commutator subgroup (H x K)’ of the product H x K 
is the product H’ x K’ of their commutators. Thus, product of nonabelian simple 
groups is also perfect, but it is not simple. 


Remark 10.2.1 Product of two commutators need not be a commutator (refer to 
Examples 10.2.6 and 10.2.7). 


Let A and B be subsets of a group G. The subgroup generated by {(a, b) | a € 
A and b € B} is denoted by [A, B]. Since (a,b)~! = (b, a), [A, B] = [B, Al. 


Proposition 10.2.2 Let H and K be normal subgroups of G. Then, [H, K] = 
[K, H] is also anormal subgroup of G, and[H, K] C H() K. 


Proof Since {(h,k) |h € H and k € K} generates [H, K], it is sufficient to show 
that g(h,k)g7! € [H, K] for all g € G, h € H and k € K. This is evident, for 
gih,k)g-' = (ghg~', gkg~') (verify). Further, since H is normal in G,kh~!k7! € 
H, and so (h,k) € H. Thus, [H, K] C H. Similarly, [H, K] C K. tt 


Corollary 10.2.3 [G,G] IG. tt 


Example 10.2.4 Qg/{1, —1}, being a group of order 4, is abelian (in fact it is isomor- 
phic to V4). Thus, Q © {1, —1}. Further, since Qg is nonabelian Og # {1}. Hence, 
Q, = {1,-1}. 


Example 10.2.5 For n> 3, S! = A,: Since S,/A, © {1, —1} is abelian, S) Cc 
A,. Further, since product of any two transpositions is expressible as products of 
cycles of length 3, A,, nm > 3 is generated by cycles of length 3. Therefore, it is 
sufficient to observe that any cycle of length 3 is a commutator. Clearly, ((jk) = 


(GK GAY Gk). 


Example 10.2.6 Let F be a field. Let G denote the set consisting of triples (a(x), 
b(y), f(x, y)), wherea(x) = ae on b(y) = EF byy; are 3-linear expressions 
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(linear form) with coefficients a;, bj € F,and f(x,y) = Le ja Bj Ai; the bilinear 
expression (form) with coefficients a;; € F’. We define the binary operation - in G by 


(a(x), b(y), f(x, y)) - (c(x), dy), g@, y) = 
(a(x) + c(x), b(y) +d(y), f(®, y) +a(x)d(y) + g(, y)). 


It can be easily checked that G is a group with respect to this binary operation. Note 
that if F is infinite, then G is infinite. Now, assume that F is finite and contains 
q elements. Then, G contains g*q*q? = q' elements. Let H denote the set of 
pairs (a(x), b(y)), where a(x) and b(y) are the 3-linear expressions as explained 
above. Then, H is an abelian group with respect to coordinate wise addition. The 
map 7 from G to H given by 7((a(x), b(y), f(x, y)) = (a(x), bQ)) is a sur- 
jective homomorphism. Since H is abelian, [G, G] C ker 7 = {(0,0, f(x, y)) | 
f(x, y) is bilinear}. Indeed, we show that [G,G] = ker 7. By the definition 
of the product, it is evident that [(a(x), 0, 0), (0, b(y),0)] = (0,0,a(x)bQy)). 
It is also clear that (0,0, f(x, y))- (0,0, 9@, y)) = (0,0, f(x, y) + 9%, y)). 
Another basic fact is that for any bilinear expression f(x, y), there exist lin- 
ear forms a(x), b(y), c(x), d(y), u(x), v(y) such that f(x,y) = a(x)b(Qy) + 
c(x)d(y) + u(x)v(y). This shows that [G, G] = ker 7. Next, it is easily observed 
that [(a(x), b(y), fx. »)), (€®), dQ), 9, Y)] = O,0, a@)d(y) — e(x)b(y)). 
It follows that [G, G] contains exactly a elements, whereas the number of commu- 
tators is at the most q°. Thus, there are many products of commutators which are not 
commutators. 


Example 10.2.7 SL(n,R)’ = SL(n,R): The group SL(n, R) is generated by the 
set of all matrices of the types E } (transvections) (see Exercise 6.4.7). It is sufficient, 
therefore, to show that each E fe i # j isacommutator. Suppose thatn > 3. Letk ¢ 
i, k # j. Then, itcan be observed that [E},, Ej;] = Ej, i # j. Thus, SL(n, R)! = 
SL(n, R) for all n > 3. Now, assume that n = 2. Then, it can be checked that E om 
is the commutator [A, E},], where A is the matrix whose first row first column entry 
iS x/ 2. the second row second column entry is Jt. and the other entries are 0. 
Similarly, Ee is also a commutator. This shows that SL(2, R)’ = SL(2, R). Thus, 
—Ih €[SL(2, R), SL(2, R)]. It can be shown that —J, is not a commutator. This 
gives another example where product of commutators is not a commutator. 


Example 10.2.8 GL(n,R)’ = SL(n,R): Since GL(n, R)/SL(n, R) © R* (the 
determinant map from GL(n, R) to R* is a surjective homomorphism whose kernel 
is SL(n, R)) is abelian, GL(n, R)’ C SL(n, R). Also, SL(n,R) = SL(n,R)' C 
GL(n, R)’. 


Proposition 10.2.9 Let f be a surjective homomorphism from H to K. Then, 
f(A’) = K". 


Proof Since f ((a, b))=f (aba~'b~')= f (a) f (b) f (a)! f(b) '=(f (a), f)), it 
follows that f(H’) C K’. Further, let (c,d) be a commutator in K. Since f is 
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surjective, there exist a,b € H such that f(a) = c and f(b) = d. But, then 
(c,d) = ({@, f(b) = f(a,b)) € f(A’). Thus, all commutators of K are in 
f(H’), and so K’ C f(A’). 


Corollary 10.2.10 Let H < G. Then, (G/H)' = G'H/H. 


Proof The quotient map v from G to G/H is a surjective homomorphism, and so, 
(G/H) = vV(G’) = GA/H. tt 


Definition Let G be a group. Define subgroups G” of G inductively as follows: 
Define G' = G’. Assuming that G" has already been defined, define G’*! = 
[G”", G"]. Thus, we get a series 


G= GeGoieG@eo...cepg'eGc"'s... 


of G. This series is called the commutator Series or the derived Series of G, and 
G" is called the nth term of the derived series. 


Remark 10.2.11 It follows by induction and Proposition 10.2.2 that each G” is a 
normal subgroup of G. In fact, it follows by induction and the fact that f((a,b)) = 
(f (a), f (b)), that each G” is fully invariant subgroup in the sense that it is invariant 
under all endomorphisms of G. 


Definition 10.2.12 A group G is said to be a solvable group (or soluble) if the 
derived series of G terminates to {e} after finitely many steps. The smallest n such 
that G’ = {e} is called the derived length of G. 


If H is a subgroup of G, then it follows by induction that H” C G" for all n. 
Thus, we have the following proposition: 


Proposition 10.2.13 Subgroup of a solvable group is solvable. tt 


If f is a surjective homomorphism from H to K, then again, by induction, it 
follows that f(H”) = K” for all n. Thus, we have the following proposition: 


Proposition 10.2.14  Homomorphic image of a solvable group is solvable. In par- 
ticular, quotient group of a solvable group is solvable. tt 


Following is a necessary and sufficient condition for a group to be solvable. 


Proposition 10.2.15 A group G is solvable if and only if it has a normal series with 
abelian factors. 


Proof Suppose that G is solvable and G” = {e}. Then, 
GSO@rqgr.bG" = fe} 


is a normal series such that all terms are commutator subgroups of the preceding 
terms, and so all factors are abelian. Conversely, suppose that G has a normal series 
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G= Go&G,;bG2E::-Gy = {e} 

all of whose factors are abelian. We show, by induction, that G' CG; for all i. 

Clearly, G° = G = Gp. Assume that G! C G;. Then, since G; /Gj+, is abelian, 

(Gi) © G41. But, then G'+! = (G')' C G © Gi41. Thus, G” CG, = {e}, and 

so G” = {e}. This shows that G is solvable. tt 


Proposition 10.2.16 Let G be a group and H a solvable normal subgroup of G such 
that G/H is solvable. Then, G is also solvable. 


Proof Since G/H is solvable, it has a normal series 


G/H = Go/H&G|/H2G2/AHC.:--6G,/H = {HA} 


with abelian factors. Further, since H is solvable, it has a normal series 
H = Wy PW >---> Hn = {e} 
with abelian factors. Consider the normal series 
G= GEG EGE::-G,=H=AHEME-:--b An = fe} 


of G. Since (G;/H)/(Gi+41/H) © G;/G;+, (firstisomorphism theorem), each factor 
of the above normal series is abelian, and hence, G is solvable. tt 


Proposition 10.2.17 Let H and K be solvable subgroups of a group G. Suppose 
that K 1G. Then, HK is a solvable subgroup. 


Proof Since K 1G,HK = KH, and so HK is a subgroup of G. Since H is 
solvable, H/H (| K is solvable. By the Noether second isomorphism theorem, 
H/H()K is isomorphic to HK/K. Hence, HK/K and K are solvable. From 
the previous proposition, H K is solvable. ft 


Corollary 10.2.18 A maximal solvable normal subgroup of a group (if exists) is the 
largest solvable normal subgroup. 


Proof If M is a maximal solvable normal subgroup and H a solvable normal sub- 
group, then by the above proposition, H M is also a solvable normal subgroup. Since 
M is supposed to be maximal, HM = M,andso H C M. tt 


Corollary 10.2.19 Every finite group has the largest solvable normal subgroup. 


The largest solvable normal subgroup of a group G (if exists) is called the radical 
of G, and it is denoted by R(G). Observe that radical of G/R(G) is trivial. A group 
G is called semisimple if R(G) = {e}. Thus, in the theory of groups, it is important 
to study solvable groups and also semisimple groups. There is a nice structure theory 
of semisimple groups and also of solvable groups. 
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Remark 10.2.20 Product of arbitrary solvable subgroups need not be solvable, even 
if HK = KH: Consider the alternating group As of degree 5. The subgroup A4 of 
permutations in As fixing the symbol 5 is a solvable subgroup (A, = V4, and V; = 
{e}). Let P be a Sylow 5-subgroup of As. Then, P, being cyclic, is solvable (P’ = 
{e}). Clearly, As = A4P is not solvable (since As is nonabelian simple, AS = As, 
and so the derived series will never terminate to {e}). 


Proposition 10.2.21 Let G # {e} be a solvable group. Then, G has a nontrivial 
abelian normal subgroup. 


Proof Since G 4 {e} and it is solvable, there exists m € N such that G” 4 {e} and 
Gt! = {e}. But, then G” is nontrivial abelian normal subgroup of G. tt 


Corollary 10.2.22 A solvable group is simple if and only if it is a cyclic group of 
prime order. 


Proof Assume that G is solvable simple group. If G is a solvable nontrivial group, 
then G’ 4 G, for otherwise all terms of the derived series would be G, and so it will 
never terminate to {e}. Since G is assumed to be simple, G’ = {e}, and so G is 
abelian. Since an abelian simple group is prime cyclic, the result follows. ft 


Corollary 10.2.23 A solvable group has a composition series if and only if it is 
finite. 


Proof Suppose that G is solvable and has a composition series. Since subgroups 
and quotient groups of solvable groups are solvable, it follows that all composition 
factors of G are solvable as well as simple. But, then they are prime cyclic (above 
proposition). Hence, the group G is finite. The converse is immediate. tt 


Now, we give some examples of solvable groups. Every abelian group G is solv- 
able, for then, G' = G’ = {e}. $3 is solvable, for 53 > A3 > {7} is a normal 
series of $3 with abelian factors. S4 is also solvable, for Sy > Ag > V4 > {J} is a 
normal series of S4 with abelian factors. The Quaternion group Qg is solvable, for 
Qs > {1, —1, 7, —i} > {1} is a normal series of Qg with abelian factors. 

Sr, nm = 5 is not solvable, for S, > A, > {7}, and S, > {7} are the only normal 
series, and none of them are with abelian factors. 


Example 10.2.24 Every group of order pq is solvable, where p and q are primes: 
If p = q,then| G|= p’, and so it is abelian. Hence, it is solvable. Suppose that 
p#q,and p > q. Then, the Sylow p-subgroup P of G is normal. Clearly, P and 
G/P (being prime cyclic) are solvable. Hence, G is solvable. 


Example 10.2.25 Every group of order 100 = 5? - 27 is solvable: Sylow 5-subgroup 
of G is normal (Sylow 2nd theorem), and also solvable (being abelian). Also G/P, 
being abelian, is solvable. Hence, G is solvable. 
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Example 10.2.26 Every group of prime power order is solvable: Let | G | = p”. 
The proof is by induction on n. If nm = 1, then G is prime cyclic, and so solvable. 
Assume that the result is true for all groups of orders p” , m <_n. Since G is prime 
power order, Z(G) # {e}. Hence, | G/Z(G) | = p™” for some m < n. By the 
induction hypothesis, G/Z(G) is solvable. Since Z(G) (being abelian) is solvable, 
G is solvable. 


Example 10.2.27 Let G bea group of order pgr, where p, g andr are distinct primes 
with p < q <r. Then, G is solvable: We know that the Sylow r— subgroup R of G 
is normal. R, being prime cyclic, is solvable. Further, | G/R |= pg, andsoG/R 
is also solvable. Hence, G is solvable. 


Remark 10.2.28 Fromaresult of P. Hall, it follows that all C.L.T. groups are solvable. 
But, the converse is not true: For example, A, is solvable, and it has no subgroup of 
order 6. 


Now, we state some results (without proofs) for the sake of information to the 
readers. 


Theorem 10.2.29 If the order of a group is product of distinct primes, then it is 
solvable. tt 


Theorem 10.2.30 Every group of order pq" is solvable. tt 
The proof of this theorem can be found in Algebra 2, Chap. 9. 

Theorem 10.2.31 Every group of odd order is solvable. tt 
Obviously, groups in the above theorems are nonsimple groups. 


Remark 10.2.32 The last theorem was conjectured by Burnside in the beginning of 
the 20th century, and it was proved by W. Fiet and J.G. Thompson in 1963. Since every 
nonabelian simple group is nonsolvable, the theorem ensures that every nonabelian 
finite simple group is of even order. Consequently, every nonabelian finite simple 
group contains an involution (an element of order 2). One of the main problems in 
theory of finite group was the classification of finite simple groups. The period from 
1950 to 1970 was very crucial for the classification problem. Brauer, Fowler, Janko, 
and their students showed that the problem of determining finite simple groups with 
given centralizer of an involution is tractable. A two major steps’ strategy for the 
classification was adopted. 

Step1. Determine the possibilities for the centralizers of involutions in a finite simple 
group. 

Step2. Determine all finite simple groups with a given centralizer of an involution in 
the group. 

The development during this period was greatly influenced by the works of Janko, 
Suzuki, Thompson, Feit, Glauberman, Ree, and others. The classification of finite 
simple groups was complete in early eighties. 
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Following is a generalization of Sylow theorems for finite solvable groups: Let 
a be a set of prime numbers. A number m is said to be 7-number, if every prime 
dividing m belongs to 7. It is said to be a 7’ number, if no prime number dividing m 
is in 7. A finite group G is said to be a 7-group, if | G | is a 7 number. A subgroup 
H of a finite group G is said to be a Hall 7-subgroup of G if it is a 7-subgroup of 
G, and [G, H] is az’ number. 


Theorem 10.2.33 (P. Hall) Let G be a finite solvable group, and 7 a set of prime 
divisors of G. Then, G has a Hall r-subgroup. Further, any two Hall t-subgroups 
are conjugates. ft 


Exercises 


10.2.1 A subset A of a group G is said to be a normal (fully invariant) subset, if it is 
invariant under all inner automorphisms (endomorphisms). Let A and B be normal 
(fully invariant) subset of G. Show that [A, B] is anormal (fully invariant) subgroup 
of G. 


10.2.2 Let G be a group, and f a group homomorphism from G to an abelian group 
A. Show that there exists a unique homomorphism f from G/ G’ to A such that 
fG’) = f(@) forall x € G. 


10.2.3 Find the commutator subgroups of the alternating groups. 
10.2.4 Show that every finitely generated group has a radical. 
10.2.5 Show that every group of order 7? - 4 is solvable. 


10.2.6 Show, by means of an example, that a subgroup of a solvable group need not 
be subnormal. 


10.2.7 Find commutator subgroup of a nonabelian group of order pg, where p and 
q are primes with p > q. 


10.2.8 Show that every group of order 53 - 6 is solvable. 
10.2.9 Show that any group whose order is less than 60 is solvable. 


10.2.10 Show, by means of an example, that a product of solvable subgroups need 
not be solvable. 


10.2.11 Show that every group of order p"q is solvable, where p, g are primes. 


10.2.12 Show that if every subgroup of G is normal (such groups are called 
Dedekind groups), then G is solvable. 
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10.2.13 Prove the following identities known as Witt-Hall identities. 

(i) (a, b)- (b,a) = e. 

(ii) (a, bc) = (a, b)(a, c)((c, a), b). 

(iii) (ab, c) = (b,c)((c, b), a)(a, b). 

(iv) ((b, a), c*) - ((a, c), B®) - ((c, b), a”) = e. 

(v) (0, a), c)- ((c, b), a) « ((a,c), b) = (b, a)(c, a)(c, b)*(a, b)(a, c) (b, c)“(a, ©) 
(c, a)’. 


for alla, b,c € G, where a? 


means bab™'!. 


10.2.14 Let T (n, R) denote the set of alln x n nonsingular upper triangular matrices 
with real entries. Show that T(n, R) is a group with matrix multiplication. Interpret 
the members of the commutator subgroup of T (n, R). Show that T (1, R) is solvable. 
Find its derived length. 


10.2.15 Find the radical of GL(n, R) and also of S$L(n, R). 


10.3 Nilpotent Groups 


Let G be a group. Define subgroups L,,(G) inductively as follows: Define Lo(G) = 
G, L\(G) = [G,Lo(G)] = [G,G] = G’. Supposing that L,(G) has already 
been defined, define L,,;(G) = [G, L,(G)]. It follows from Proposition 10.2.2 
that each L,,(G) is normal (in fact, fully invariant) in G. Thus, we get a descending 
chain 

G= L(GEL (GE LA(G)C---EL,(G)E--- 


of normal subgroups of G. This chain is called the lower central series of G. 
Since [53, $3] = A3 = [S3, As], 


S3 = S83 & Az & AZ --- 
is the lower central series of $3. Similarly, 

Sg = Sy b Ay & Age :-- 
is the lower central series of S4. Note that the lower central series, in general, is 
different from the derived series. 

Next, define normal subgroups Z,(G) of G inductively as follows: Define 
Zo(G) = {e} ,Z:(G) = Z(G). Observe that Z1(G)/Zo(G) = Z(G/Zo(G)) 
is the center of G/Zo(G). Supposing that Z,,(G) has already been defined, define 
Zn+1(G) by the equation 

Zn+1(G)/Zn(G) = Z(G/Z,(G)). 


Thus, we get an ascending chain 
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{e} = Zo(G) J Z\(G) I Z2(G) J --- I Z,(G)<--- 


of normal subgroups of G. This chain is called the upper central series of G. 
Since the center of S,, is trivial for all n > 3, the upper central series of S,, is 


fe} fe} St fe} += 


for alln > 3. 


Remark 10.3.1 It follows by induction that the terms of the lower central series are 
fully invariant, whereas the terms of upper central series are characteristic subgroups. 
Note that center, in general, is not a fully invariant subgroup. 


Theorem 10.3.2 The lower central series of G terminates to {e} at the nth step if 
and only if the upper central series terminates to G at the nth step (i.e., L,(G) = {e} 
if and only if Z,(G) = G). 


Proof Suppose that L,(G) = {e}. We show, by induction on 7, that L,-;(G) C 
Zi(G) foralli. Fori = 0, Ly_o(G) = Ly(G) = {e} = Zo(G), and the result is true 
for 0. Assume that Ln_i(G) © Z;(G). We show that L,_«+1)(G) © Zi41(G). Leta € 
£,3-4(G), Since Ly 4(G) = [6,15 4-4(G)], xax ae € Ey _(G) © 7(G) ior 
all x € G. This shows that xZ;(G)aZ;(G) = aZ;(G)xZ;(G) for all x € G. Thus, 
aZ;(G) belongs to the center of G/Z;(G). By the definition a € Z;+;(G). Putting 
i = n, we find that Z,(G) = G. 

Conversely, suppose that Z,(G) = G. We show, by induction, that L;(G) C 
Zn—i(G) for alli. Fori = 0, Lo(G) = G = Z,,(G), and the result is true for 
i = 0. Assume that L;(G) C Z,_;(G). We show that L;.;(G) C Z,_;-;(G). From 
the definition of L;,;(G), it suffices to show that xax~!a~! € Z,_;-,(G) for all 
x € Gand a €L;(G). Leta € L;(G). Since L;(G) is assumed to be contained in 
Zn-i(G), a € Z,-;(G). But, then by the definition of Z,_;, a@Z,-;-1(G) belongs 
to the center of G/Z,_;-1(G). Thus, xZ,-;-1(G)aZ,_j-1(G) = aZp,~j-1(G) 
xZ,—-i-1(G) for all x € G. This means that xa(ax)! € Z,_i-1(G) for all x € G. 
This completes the proof of the fact that L;(G) C Z,_;(G) for alli. Puttingi = n, 
we get that L,(G) C Zo(G) = fe}. tt 


Definition 10.3.3 A group G is said to be nilpotent if L,(G) = {e}, or equivalently, 
Z,(G) = Gforsomen. A group G is said to be nilpotent of class nif L,(G) = {e} 
but L,-1(G) & {e}, or equivalently, Z,(G) = G but Z,_|(G) £G. 


Proposition 10.3.4 Every nilpotent group is solvable. 


Proof The result follows from the simple observation (do it by induction) that the 
nth term G” of the derived series of G is contained in L,(G) the nth term of the 
lower central series of G. Thus, if L,(G) = {e} forsomen, then G” = {e}. tt 


Proposition 10.3.5 Subgroup of a nilpotent group is nilpotent. Homomorphic image 
(and so also quotient group) of a nilpotent group is also nilpotent. 


10.3 Nilpotent Groups 367 


Proof Let H be a subgroup of a nilpotent group G. It follows, by induction, that 
L,;(H) C L;(G) for all i. Thus, if L,(G) = {e}, then L,(H) = {e}. Further, if 
f is a surjective homomorphism from H to K, then it follows, by induction, that 
f(L£,(A)) = L,(K) for all n. The result follows. tt 


Proposition 10.3.6 Direct product of finitely many nilpotent groups is nilpotent. 


Proof It is sufficient to prove that the direct product of two nilpotent groups is 
nilpotent. Let H and K be groups. It follows, by induction, that L,(H x K) = 
L,(A) x L,(K). Thus, if L,(H) = {e} and L,(K) = {e}, then L,(H x K) = 
{(e, e)}, where k = max(m.n). tt 


Proposition 10.3.7 Let G be a nontrivial nilpotent group. Then, Z(G) 4 {e} (thus, 
in a nontrivial nilpotent group, there is an element different from identity, which 
commutes with each element of the group). 


Proof Suppose that Z(G) = {e}. Then, Z;(G) = Z(G) = {fe}. Since Z2(G)/ 
Z(G) = Z(G)/{e} = Z(G/Z\(G)) = Z(G/{e}) = Z(G)/{e} = {e}/{e}, 
Z2(G) = {e}. Proceeding inductively, we see that Z,(G) = {e} for all n. Hence, 
Z(G) can never be G, and so G is not nilpotent. tt 


Remark 10.3.8 A solvable group need not be nilpotent: $3 is solvable. Since 
Z(S3) = {J}, S3 is not nilpotent. Further, unlike the case of solvable groups, even 
if a normal subgroup H and the quotient group G/H are nilpotent, G need not be 
nilpotent: A3 is normal nilpotent subgroup of $3 such that $3/A3 ~ {1, —1} is also 
nilpotent, but $3 is not nilpotent. 


Proposition 10.3.9 Let H be anormal subgroup of G contained in the center Z(G) 
of G such that G/H is nilpotent. Then, G is nilpotent. 


Proof Suppose that G/#H is nilpotent. By the first isomorphism theorem, G/Z(G) ~ 
(G/H)/(Z(G)/H). Since quotient of a nilpotent group is nilpotent, G/Z(G) 
is nilpotent. We show, by induction, that Z;(G/Z(G)) = Zj41(G)/Z(G) for 
all i. Zo(G/Z(G)) = {Z(G)} = Z(G)/Z(G) = Z\(G)/Z(G). Assume that 
Zi(G/Z(G)) = Zi41(G)/Z(G). Then, 


Zi41(G/Z(G))/Z(G/Z(G)) = Z((G/Z(G))/Z;(G/Z(G))) = 
Z((G/Z(G))/(Zi+1(G)/Z(G))). 


Further, we know that (G/Z(G))/(Zj41(G)/Z(G)) © G/Z;+1(G), and the canon- 
ical isomorphism ¢@ is given by 


P(aZ(G) - (Zi41(G)/Z(G)) = aZj+1(G)). 


The center Z(G/Z;+1(G)), by definition, is Z;42(G)/Zj+1(G). Also, 


Z(G/Z(G))/(Ziz1(G)/Z(G))) = $'(Zi42(G)/Zins(@) = 
(Zj+2(G)/Z(G))/(Zi41(G)/Z(G)) = (Zi+2(G)/Z(G))/Z)(G/Z(G)). 
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Hence, Z;41(G/Z(G)) = Zj42(G)/Z(G). By the principle of induction, it fol- 
lows that Z;(G/Z(G)) = Zj4:(G)/Z(G) for all i. If G/Z(G) is nilpotent, then 
Zni1(G)/Z(G) = Z,(G/Z(G)) = G/Z(G) forsomen. But, then Z,.;(G) = G, 
and so G is nilpotent (the class of nilpotency of G is one more than that of G/Z(G)). £ 


Corollary 10.3.10 G is nilpotent if and only if G/Z(G) is nilpotent. tt 
Corollary 10.3.11 Every finite p-group is nilpotent. 


Proof Every finite p-group is of order p” for some n. The proof is by induction on 
n. If | G|=_ p, then G is prime cyclic, and so it is nilpotent. Assume that every 
group of order p” , m <_ n is nilpotent. Let G be a group of order p”. Then, 
Z(G) # {e}. Hence, | G/Z(G) | = p” , m < _n. By the induction assumption, 
G/Z(G) is nilpotent, and so from the above proposition, G is nilpotent. tt 


Remark 10.3.12 Aninfinite p-group need not be nilpotent. In fact, there are uncount- 
ably many groups, known as Tarski groups, which are infinite 2-generator simple 
groups all of whose proper subgroups are of same prime (sufficiently large) order. 


Corollary 10.3.13 [f all Sylow subgroups of a finite group are normal, then the 
group is nilpotent. 


Proof Tf all Sylow subgroups are normal, then it is direct product of its Sylow sub- 
groups. Since prime power order groups are nilpotent (above corollary), and the 
direct product of nilpotent groups are nilpotent, the result follows. ft 


Proposition 10.3.14 Let H and K be normal subgroups of G such that G/H and 
G/K are nilpotent groups of classes a and 3, respectively. Then, G/(H (\ K) is a 
nilpotent group of class at most max(q, (3). 


Proof Define a map 7: G —> G/H x G/K by n(x) = (xH,xK). Then, 7 is a 
homomorphism with kernel H (| K. By the fundamental theorem of homomorphism, 
G/H () K is isomorphic to a subgroup of G/H x G/K. Since product of nilpotent 
groups is nilpotent, and subgroup of a nilpotent group is nilpotent, it follows that 
G/H () K is nilpotent of class at most max(a, /3). tt 


Proposition 10.3.15 Let H be anormal nilpotent subgroup of G of class a, and K 
anormal nilpotent subgroup of class 3. Then, H K is nilpotent group of class at most 
at 6. 


Proof The proof is by induction ona+ 6.Ifa+ 6 = 1, thena = 1, 6 = 0 
ora = 0, @ = 1.Ifa = land = 0, then K = {e} and HK = H 
is already nilpotent of class a = a+0. Similarly, if a = 0, then H = {fe} 
and HK = K is nilpotent of class 3. Suppose that the result is true in all cases 
when the sum of nilpotency classes is less than a + 3. Let H and K be normal 
nilpotent subgroups of classes @ and (3, respectively. Since H is a normal sub- 
group of G , Z(#), being characteristic subgroup of H, is normal in G. Similarly, 
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Z(K) is also normal in G. Clearly, H/Z(H) and K Z(H)/Z(#) are normal sub- 
groups of G/Z(#). Further, H/Z(#) is a nilpotent group of class at most a — 1 
and K Z(H)/Z(H) © K/K () Z(4A) is nilpotent of class at most (3. Hence, by the 
induction hypothesis, H/Z(H)- K Z(H)/Z(H) = HK/Z(A) is nilpotent of class 
at most a+ 3 — 1. Similarly, HK /Z(K) is nilpotent of class at most a+ (3 — 1. 
Hence, HK/Z(H) x HK/Z(K) is nilpotent of class at most a + 3 — 1. From the 
previous proposition H K /Z(H) (\ Z(K) is nilpotent of class at most a+ 3 — 1. 
Since Z(H) () Z(K) C Z(HK), it follows that HK is nilpotent of class at most 
a+ (3. The result follows by the principle of induction. ft 


Remark 10.3.16 Product of a normal nilpotent subgroup with a nilpotent subgroup 
need not be nilpotent: The group $3 is not nilpotent, whereas it is product of the 
normal nilpotent subgroup A3 and the nilpotent subgroup {/, (12)}. But, it is true (a 
result of Wielandt) that product of any two nilpotent subgroups (if it is a subgroup) 
is solvable. Observe that product of solvable subgroups need not be solvable. For 
example, As = A4H, where H is the subgroup of A; generated by a cycle of length 
5. Clearly, Ay and H are solvable, whereas As is not solvable. 


Corollary 10.3.17 Maximal normal nilpotent subgroup is the largest nilpotent 
normal subgroup. 


Proof If H is a maximal normal nilpotent subgroup, and K is a normal nilpotent 
subgroup, then by the above proposition, H K is also normal and nilpotent. Since H 
is maximal, HK = H,andsoK CH. tt 


Definition 10.3.18 The largest normal nilpotent subgroup (if exists) is called the 
Fitting subgroup of G, and it is denoted by Fit(G). 


Let H be a subgroup of a group G. Define a sequence {NG(H) | n € NU {0}} 
of subgroups of G inductively as follows: Define NO (H ) = H. Assuming that 
NG (4) has already been defined, define NOt) = Nc(NG(4A)). Thus, we get 
an ascending chain 


H = NG(H)ING(A) A--- I NG(A) 


Proposition 10.3.19 Let G be a nilpotent group of class n. Then, NG(H) = G for 
all subgroups H of G. 


Proof Suppose that G is nilpotent of class n. Then, Z,(G) = G. We show, by 
induction on r, that Z,(H) C NG(#) for all r. Zo(G) = {e} CH = Ne(A). 
Assume that Z(G) C NG(H). Let a € Z,41(G). Then, aZ,(G) € Z(G/Z,(G)). 
Hence, aZ,(G)xZ,(G) = xZ,(G)aZ,(G) for all x € G. But, then axa~!x-! € 
Z(G), and soaxa~'x~! € N@(H) forall x € G. In particular, axa~' € NG(H) for 
all x € NG(#). This shows that a € Ng(NG(A)) = Not! (A). By the principle of 
induction, Z,(G) C NG(#) for all r. In particular,G = Z,(G) C Né(A), and so 
NG(A) = G. ft 
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Definition 10.3.20 A group G is said to satisfy the normalizer condition if every 
proper subgroup is properly contained in its normalizer. 


Corollary 10.3.21 Every nilpotent group satisfies normalizer condition. 


Proof Let G be a nilpotent group of class n. Then, N6(H) = G for all subgroups 
H. If H is a proper subgroup which is not properly contained in Ng(#), then 
No(H) = H,andso NG(H) = H # G. This is a contradiction. t 


Proposition 10.3.22 If a finite group satisfies normalizer condition, then all its 
Sylow subgroups are normal. 


Proof Suppose that G satisfies normalizer condition. Let P be a Sylow p-subgroup 
of G. By the illustration 9.2.6, Ng(NGg(P)) = NG(P). Since the group satisfies the 
normalizer condition, Ng(P) cannot be a proper subgroup of G. Thus, Ng(P) = G, 
and so P is normal in G. t 


Combining above results, we obtain the following corollaries. 


Corollary 10.3.23 A finite group is nilpotent if and only if all its Sylow subgroups 
are normal in G. tt 


Corollary 10.3.24 A finite nilpotent group is direct product of its Sylow subgroups. 
Thus, finite nilpotent groups are precisely products of prime power ordered groups. 
Corollary 10.3.25 A finite nilpotent group is a C.L.T. group. 


Proof Since a finite p-group is a C.L.T. group and direct product of C.L.T. groups 
is C.L.T. groups, the result follows from the above corollary. ft 


Remark 10.3.26 A C.L.T. group need not be nilpotent. For example, $3 is C.L.T. 
group but it is not nilpotent. However, a C.L.T. group is always solvable. 


Theorem 10.3.27 (Wielandt) A finite group is nilpotent if and only if all its maximal 
subgroups are normal. 


Proof Let G be a finite nilpotent group and M a maximal subgroup of G. Then, 
since it satisfies normalizer condition, M is properly contained in Ng(M). Since 
M is maximal, Ng(M) = G, and so M is normal. Conversely, suppose that all 
maximal subgroups of G are normal. We show that all Sylow subgroups of G are 
normal. Let P be a Sylow p-subgroup of G. Suppose that Ng(P) # G. Since G is 
finite, there is a maximal subgroup M containing Ng(P). But, then NG(M) = M, 
a contradiction to the supposition that all maximal subgroups are normal. Thus, all 
Sylow subgroups are normal, and so G is nilpotent. tt 


10.3 Nilpotent Groups 371 


Frattini Subgroup 


Let G be a group. If G has no maximal subgroups, then we define the Frattini 
subgroup of G to be G itself. If G has maximal subgroups, then the Frattini 
subgroup of G is defined to be the intersection of all maximal subgroups of G. 
Thus, if we agree to the convention that intersection of an empty family of subsets 
of a set X is X itself, then the Frattini subgroup is the intersection of all maximal 
subgroups of G. We use the notation ®(G) for the Frattini subgroup of G. Thus, if 
M denotes the family of all maximal subgroups of G, then ®(G) = ()yey H. 

Let G bea group anda € Aut(G). Let H beasubgroup of G. Then, H is maximal 
if and only if a@( 1) is maximal. In fact, a induces a bijective map from M to itself 
given by H ~ a(/7/). Further, since a is bijective it preserves intersection, and so 
a(®(G)) = &(G). Hence, the Frattini subgroup ®(G) is a characteristic subgroup. 
In particular, it is also normal. 


Example 10.3.28 Maximal subgroups of Z are of the form pZ, where p is a prime. 
Since there is no integer except 0 which is multiple of all primes, it follows that the 
intersection of all maximal subgroups of Z is {0}. Hence, ®(Z) = {0}. 


Example 10.3.29 Maximal subgroups of Qs are 
(1, —1, 4, —i}, (1, -1, 7, —J}, (1, -1, &, —k}, and so (Qs) = {1, —]}. 


Example 10.3.30 ®(S3) = {I}, for A3 and {/, (12)} are maximal subgroups of $3. 
@®(S4) = {J}, for Ay and V4 are the only proper nontrivial normal subgroups, and 
they are not contained in all maximal subgroups. Also ®(S,,) = {7}, n > 5, for the 
only nontrivial proper normal subgroup of S, is A,, and A, is not contained in all 
maximal subgroups (for example, it is not contained in S,,_1). 


Theorem 10.3.31 (Wielandt) A finite group G is nilpotent if and only if G’ C ®(G). 


Proof Let G be a finite nilpotent group and H a maximal subgroup. Then, from the 
previous theorem of Wielandt, H is normal subgroup of G. Since H is maximal, 
G/H is a group without proper subgroups, and so it is prime cyclic. In particular, it 
is abelian. But, then G' C H. This shows that G’ C ®(G). Conversely, suppose that 
G’ © ®(G). Then, every maximal subgroup of G contains G’, and so every maximal 
subgroup is normal. By the previous theorem of Wielandt, G is nilpotent. tt 


Now, we characterize elements of the Frattini subgroup ®(G). 


Definition 10.3.32 An element x € G is called a nongenerator of G if whenever 
a set S generates G, S — {x} also generates G. 


Thus, a nongenerator has essentially no role in generating the group. e is always 
a nongenerator, for any subgroup will contain e. 0 is the only nongenerator of Z: 1 
cannot be a nongenerator, for < 1 > = Z but < J >¥ Z. Similarly, —1 cannot be 
a nongenerator. Let m € Z,m #0,|m|> 1. Then, there exists n A +1 such that 
(m,n) = 1. But, then < {m,n} >= Zand < {n} > = nZ £Z. 

1 and —1 are the only nongenerators of Q(verify). 
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Theorem 10.3.33 The Frattini subgroup ®(G) is the set of all nongenerators of G. 


Proof Let X denote the set of all nongenerators of G. We have to show that ®(G) = 
X. Suppose that x ¢ ®(G). Then, there exists a maximal subgroup H of G such that 
x ¢ H.But, then < H U{x} >= Gand< H >= H 4G. This shows thatx ¢ X, 
and so X C ®(G). Next, suppose that x ¢ X. Then, there exists a subset S of G such 
thatx ¢ S, <SU{x} >= Gand<S>4G.LetH =< S >. Let F denote the 
set of all proper subgroups of G which contain H. Then, F 4 J, for H € F. Thus, 
F is a nonempty partially ordered set with respect to inclusion. Let {H, | a € J} 
be a chain in F. Since union of a chain of subgroups is a subgroup, U,<,; Ho is a 
subgroup of G. Since each H, is a proper subgroup of G containing H , x ¢ H, 
for all a € J. Hence, x ¢ U,-; Ha, and so U,-; Ha € F. This shows that every 
chain in F has an upper bound. By the Zorn’s Lemma, F has a maximal element 
L (say). Then, (i) x ¢ L and (ii) L is a maximal subgroup, for if K is a subgroup 
of G such that L is properly contained in K, then K ¢ F, andso x € K. But, then 
G =< S\U{x} >C K. This shows that x ¢ ®(G). Hence, ®(G) C X. tt 


Theorem 10.3.34 Frattini subgroup of a finite group is always nilpotent. 


Proof Let G bea finite group, and P a Sylow p-subgroup of the Frattini subgroup. It 
is sufficient to show that P is normal in ®(G). In fact, we show that P is normal in G. 
Let g € G. Since ®(G) 1G, gPg"! C &(G). Thus, P and gPg™! are both Sylow 
p-subgroups of ®(G), and so they are conjugate in ®(G). Hence, there exists u € 
®(G) such that gPg~' = uPu7!. But, thenu~'g € Ng(P). Thus, g € uNg(P) for 
someu € ®(G). This shows thatG = ®(G)Ng(P) = < ®(G) LU Ng(P) >. Since 
every element of ®(G) is a nongenerator, and since G is finite, G = < Ng(P) > 
= Noc(P). This proves that P IG. tt 


To summarize, we have the following: 


Theorem 10.3.35 Let G be a finite group. Then, the following conditions are equiv- 
alent. 


(i) G is nilpotent. 
(ii) G satisfies normalizer condition. 
(iii) Every subgroup of G is subnormal. 
(iv) There isn such that NG(H) = G for all subgroups H of G. 
(v) Every Sylow subgroup of G is normal. 
(vi) G is direct product of prime power ordered groups. 
(vii) Every maximal subgroup of G is normal. 
(viii) G' C ®(G). tt 


Finite nilpotent groups are direct product of prime power ordered groups. The 
problem of classification of finite nilpotent groups reduces to the classification of 
finite p-groups. This is far more a complex problem. Here we shall restrict ourselves 
to some elementary facts about finite p-groups. 

Groups of order p’. 
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It follows from the structure theorem of finite abelian groups, that there are three 
nonisomorphic abelian groups of order p*, and they are (i)Z ps (ii) Zp? ® Zp, and 
(iii) Z, B Zp © Zp. 

Let G be a nonabelian group of order p>. Then, Z(G) # {e}. Since G is non- 
abelian, Z(G) 4 G.| Z(G) | p’, for otherwise G/Z(G) would be cyclic, and this 
is impossible. Thus, Z(G) is a cyclic group of order p. Also, since G is nonabelian 
and G/Z(G) is abelian, G’ = Z(G) (check that ®(G) is also Z(G)). Since G/Z(G) 
is noncyclic abelian group of order p*, G/Z(G) is direct product of two cyclic groups 
of order p. Suppose that 


G/Z(G) =< aZ(G) > @ < bZ(G) > 


Clearly, a? € Z(G) , b? € Z(G), and also the commutator (a,b) € G’ = Z(G). 
Suppose that (a,b) = c. Further, (a’,b) = a‘ba“ib-! = ai(bab"!)' = 
a(c la)! = aia“ic! = c'. Similarly, (a, b') = c!. Hence, c = (a,b) gen- 
erates Z(G). We have the following three cases: 

(i) G =<a,b\|a? =e = bP = (a,b)? = (a,(a,b)) = (b, (a,b)) >. 

(ii) G =<a,b\|a” =e = b” = (ab) = (a,(a,b)) = (b, (a,b) >. 
and 

(iii) G =<a,b|a”™ =e = D?,a? = (a,b) >. 


In case (i) the group is of exponent p, and it cannot occur if p = 2. Suppose that 
p is an odd prime. Then, groups described in (i) and (iii) are nonisomorphic (prove 
it). We show that the group described in case (ii) is isomorphic to that described in 
case (iii). The map which takes b in (iii) to ab in (ii), and a to a gives an isomorphism. 
If p = 2, then the group in (ii) is the Quaternion group Qg, and the group in (iii) is 
the dihedral group Dg, and they are not isomorphic. Thus, there are two nonabelian 
groups of order p*, and three abelian groups of order p>. Overall there are five 
nonisomorphic groups of order p*. 

In what follows, we will use some results from linear algebra, and the reader may 
refer to Algebra 2 or any other book on linear algebra for the purpose. The reader 
may also skip the proof till they acquire basic knowledge of linear algebra. 

A set S of generators of G is said to be a minimal set of generators or an irreducible 
set of generators, if no proper subset of S generates G. Every finite group has a 
minimal set of generators (prove it). The additive group Q of rationals has no minimal 
set of generators (prove it). It is easy to show (do it as exercise) that if a group has 
a minimal set of generators, then every set of generators contains a minimal set 
of generators. Distinct minimal set of generators may contain distinct number of 
elements. For example, {1} and {5,7} both are minimal set of generators of Z. For 
finite p-groups, we have the following theorem. 


Theorem 10.3.36 (Burnside Basis Theorem) Let G be a finite p-group of order p". 
Suppose that G/®(G) contains p" elements. Then, any minimal set of generators of 
G contains r elements (r is called the rank of the group). 
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Proof Since G is nilpotent, G’ C ®(G), and so G/®(G) is abelian. Further, given 
any maximal subgroup H of G, it is normal, and so G/H is prime cyclic. Hence, 
a” € H forall maximal subgroup H of G. This shows that G/®(G) an abelian group 
all of whose elements are of order p. This means that G/®(G) is a vector space over 
Z,» containing p’ elements. Thus, Dimz,G/®(G) = r, and so every minimal set 
of generators of G/®(G) contains exactly r elements. Further, if S generates G, 
then v(S) = {a®(G) | a € S} generates G/®(G). Conversely, suppose that S is 
such that v(S) generates G/®(G). Then, G = vl(< v(S) >) =< S> @(G) = 
< SU ®(G) >. Since ®(G) is a set of nongenerators, < S > = G. The result 
follows. tt 


Let G be a finite p-group of order p” and suppose that G/®(G) contains p” 
elements. Consider the group Aut(G) of automorphisms of G. Since every auto- 
morphism of G takes ®(G) to itself, it induces an automorphism of the vector space 
G/®(G). Thus, we get ahomomorphism 7) from Aut(G) to the group GL(r, p) of 
all automorphisms of the vector space G/®(G). It is an elementary fact from linear 
algebra that 


| GLO, p) |= (p" — Dip" — p)(p" — p*) +++ (p" = p!). 


Thekern = {a € Aut(G) | n(a@) = Ig; aq}. Let us denote this normal subgroup 
of Aut(G) by H. Then, Aut(G)/H is isomorphic to a subgroup of GL(r, p). 

Let {x1,x2,...,x,} be an ordered irreducible set of generators for G. Then, 
{WU X1, U2X2,...,U,xX,} iS again an ordered irreducible set of generators of G for all 
Uj, U2,...,u, € ®(G). Ifa € A,thena(x;) = uy, x1, a(x.) = U2X2,..., A(X%,) = 
u,X, for some uj, U2,...,uU, € B(G). Let S be the set of all ordered set of gener- 
ators of forms 4x1, UjX2,...,U,X,, Where uj, U2,...u, € ®(G). Clearly, | $ | = 
(py — pen, 

The group H acts on S through an action * given by a * ((yj, y2,---, ¥-)) = 
(a(y1), a(y2),.--, a(y,)). Ifa € A fixes any element of S, then it fixes each member 
of a generator of G, and so it is identity (every local isotopy group is trivial). Hence, 
every orbit of the action contains exactly h elements, where h = | H |. Thus, h 
divides | S |= p~"”. Since Aut(G)/H is isomorphic toa subgroup of GL(r, p), | 
Aut(G)/G | divides | GL(r, p) | = (p" — 1)(p" — p)--- (p" — p™~'). Hence, the 
order of Aut (G) divides p"—"” (p" — 1)(p" — p)---(p" — p’"'). 

Now, we classify a very special type of p-groups known as extra special p-groups. 

A finite nonabelian p-group is called an extra-special p-group if Z(G) = G’, 
and it is a cyclic group of order p. Thus, every nonabelian group of order p? is an 
extra special p-group. 

A group G is said to be the central product of its normal subgroups G1, G2,..., 
G, if 


(i) G = G,G2---G,, 

(ii) Z(G) C G; for alli, 
Gii) G/Z(G) is direct product of G;/Z(G), G2/Z(G),..., G,/Z(G), and 
(iv) [G;, Gj] = {e} for alli Fj. 
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Theorem 10.3.37 Let G be a finite extra special group. Then, it is central product 
of nonabelian groups of order p°. In particular, G is of order p*"*' for some m. 


Proof Since G’ = Z(G), (x, ajd2) = (Xx, a1)(X, a2) ((d2, xX), a1) = (x, a))(X.a2). 
In particular, (x,a’) = (x,a)? = e forall x € G. Hence, a? € Z(G) = G’ for 
all a € G. Thus, G/Z(G) is an elementary abelian p-group, and so it is a vector 
space over the field Z,,. Let c be a generator of Z(G). Note that (xu, yv) = (x, y) 
for all u, v € Z(G). Thus, (x, y) depends only on cosets x Z(G) and yZ(G). This 
defines a map f from G/Z(G) x G/Z(G) to Z, by cf°4)2@) = (x, y). It 
is straight forward to verify that f is a skew-symmetric bilinear form on the vector 
space G/Z(G). Further, if f(xZ(G), yZ(G)) = o forall yZ(G) € G/Z(G), then 
(x,y) = e forall y € G. But, then, x € Z(G), andsoxZ(G) = Z(G) the zero of 
G/Z(G). This shows that f is a nondegenerate skew-symmetric bilinear form. It is 
a standard result (see Algebra 2) of linear algebra that the vector space G/Z(G) is 
of even dimension 2m for some m, and there is a basis 


{x1 Z(G), Z(G), ...,XmZ(G), 1 Z(G), Y2Z(G), ..- Yn Z(G)} 


of G/Z(G) such that 


Gi) f(x; Z(G), y,Z(G)) = | for alli, 
(ii) f(x; Z(G), yj Z(G)) = Ofori A j, and also 
Gili) f(a Z(G), xjZ(G)) = 0 = f(y Z(G), yj Z(G)) for all i, 7. 


Let G; be the subgroup generated by {x;, y;}. Then, G; is a nonabelian group of 
order Pp. Since Z(G) = ®(G) and G/Z(G) is generated by 


{x1 Z(G), Z(G), ...,xXmZ(G), v1 Z(G), Y2Z(G), «Yn Z(G)}, 


{X1, V1, X2, Y2,---,Xm; Ym} generate G. Hence 

(i) G= G\G2 eae Gn. 

Further, since f(x;Z(G), y;Z(G)) = 1, (x, y;) = c € G;. Thus, 

(ii) Z(G) C Gj. 

(iii) Clearly, G/Z(G) is direct product of G;/Z(G), G2/Z(G), ...Gm/Z(G). 
Again, fori A j f(x;Z(G),x;Z(G)) = 0 = f(x; Z(G), y;Z(G)). Hence 

(iii) [G;, G;] = {e} foralli F j. 

This shows that G is central product of G1, G2,..., Gn. tt 


Remark 10.3.38 It is easy to observe that the converse of the above proposition is 
also true. 


Exercises 
10.3.1 Let G be a group of order p*. Can Z(G) = G’ = ®(G)? Support. 


10.3.2 Find the radical of GL(n, R) and also of SL(n, R). 
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10.3.3 Show that every group of order 57 - 7° is nilpotent. Find the number of 
nonisomorphic groups of this order and list them. 


2 


re 


10.3.4 Find the number of nilpotent groups of order p7p3- ++ p 


10.3.5 Let G be a finite nilpotent group, and H a nontrivial normal subgroup of G. 
Show that H (| Z(G) # {e}. 


10.3.6 Show that a minimal normal subgroup of a nilpotent subgroup is contained 
in the center. 


10.3.7 Let G be a nilpotent group, and H a maximal normal abelian subgroup of 
G. Show that H = Cc(A). 


10.3.8 Show that T(n, R) is not nilpotent. 


8.7.9. Show that the group U (n, R) of upper triangular matrices all of whose diagonal 
entries are | is a nilpotent group. Find the index of nilpotency. 


10.3.10 Let G be a finite nilpotent group and p divides | G |. Show that p divides 
| Z(G) |. 


10.3.11 Show that a finite group G is nilpotent if and only if elements of co-prime 
orders commute. 


10.3.12 Can $3 be the Frattini subgroup of a finite group? Support. 


10.3.13 Show that G; © G2 implies that ®(G,) ~ ®(G2). Show, by means of an 
example, that the converse is not true. 


10.3.14 Show that a finite group G is nilpotent if and only if G/®(G) is direct 
product of cyclic groups of prime orders. 


10.3.15 Let G be a finite group. Show that G is nilpotent if and only if G/®(G) is 
nilpotent. 


10.3.16 Can G/®(G) © Qg? Support. 


10.3.17 Let G; and G2 be groups and G = G, x G2. Show that ®(G) is asubgroup 
of ®(G,) x ®(G2). Show that they are equal provided that G; and G2 both are finite 
(or at least finitely generated). In general, it is not known whether the equality always 
holds. This problem is related to the problem of existence of maximal subgroups of 
a simple group. 


10.3.18 Characterize finite nilpotent group G for which G/G’ is cyclic. 
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10.4 Free Groups and Presentations of Groups 


Consider a group G which is generated by an element x and it is subject to only one 
condition x” = e. Clearly, this is acyclic group of order n which is isomorphic to Z,,. 
If a group G is generated by an element x subject to the conditions x” = e = x”, 
then it is a cyclic group of order d, where d is the greatest common divisor of 
m and n. Let us consider a group G generated by {a, b} subject to the conditions 
a” = b” = e, bab-! = a’. Then,a = b"™ab-” = a™,andso,a”—' = e.In 
turn, a’ = e, whered is the g.c.d. ofr” — 1 andn. Itturns out thatG = {a'b/ | 0 < 
i<d—1,0< j <m-— 1} isa group of order dm. The multiplication in G is given 
by a’b/akb! = a"b’, where u is the remainder obtained when r/k + i is divided 
by d and v is the remainder obtained when j + / is divided by m. Description of a 
nonabelian group of order pq is an example of this kind. 

One of the central problems in group theory with tremendous applications to 
topology, geometry, and other branches of mathematics and physics is to realize the 
group, once a set of generators together with relations is given. For example, to every 
knot there is a group attached, called the knot group, which is invariant of the knot, 
and whose generators and relations can be obtained. It is, therefore, one of the main 
problems in knot theory, and in turn, in topology, to have an effective procedure by 
which one can distinguish groups given by generators and relations. Poincare, Dehn, 
and Tietze realized the importance and the gravity of the problem, and they made 
some initial and original contributions in this direction. 

In this section, we introduce formal theory of presentations of groups. 

We have the following universal problem: 

Let X be a set. Does there exist a pair (G,i), where G is a group and i a map 
from X to G such that given any pair (H, j), where H is a group and j a map from 
X to H, there is a unique homomorphism 77 from G to H which makes the following 
diagram commutative? 

i 


\ / 
H 


We show that the solution to the above problem exists, and it is unique up to 
isomorphism. 
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Uniqueness. Let (G, i) and (H, j) be solutions to the above problem. Then, there 
exist a homomorphism 7) from G to H and a homomorphism p from H to G such 
that the diagram 

i 
xX CG 


j 


p 


is commutative. Thus, pon and Jg both will make the left upper half triangle com- 
mutative. From the universal property of (G, i), it follows that pon = Ig. Similarly, 
nop = Ty. Thus, 7) is an isomorphism. 

Existence. Recall that if A and B are sets, then A? denotes the set of all maps 
from B to A. For each n € N, let 7 denote the set {1,2,...,}. Consider the set 
X(X) = Unen(X x {1, -1})” U{}. Elements of £(X) are called the words in X. 
% is called the empty word. A map f from7 to X x {1, —1} is an element of U(X). 
This element f is formally and conveniently denoted by 


1 | yen 
x1 Xy Xn > 


where f(i) = (x, 6) for all i © 7. We also denote the empty word by 1. We 
define product in £(X) by juxtaposition. Thus, if W) = x]''x5?--- x?" and W. = 


: yf? ... yl, then the product WW) is defined by 


By 


_— yar a2 Oy By Bs 
Wi W2 = xy Xy ae, yi ya reey 


Ss 


Given a word W = xj''x5?---x@* , W7! denotes the word x70 x71 ++ +x] 

The empty word | acts as identity with respect to the multiplication. We also identify 
1~! with 1. It is clear that ©(X) is a semigroup with identity. Define a relation R 
on &(X) as follows: (W,, W2) € R if and only if W> can be obtained from W,; by 
insertion of words of the types x°x~° between any two consecutive letters of the 
word W,, or deletion of such words whenever it appears in W; as a segment. Clearly, 
R is an equivalence relation on &(X). Let F(X) denote the quotient set, and [W] 
denote the equivalence class determined by the word W. It is evident that if W, is 
related to W; and W3 is related to W5, then W W2 is related to W; W;. This shows 
that we have a binary operation on F(X) defined by [W,][W2] = [W, W2]. Itis also 


clear that F(X) is a group with respect to this operation. We have a map i from X to 
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F(X) defined by i(x) = [x!]. We show that the pair (F(X), 7) is a solution to the 
universal problem. Let j be amap from X toa group H. Then, we have a map ¢ from 
5 (X) to H given by h(x x9? «+ x8) = (irr) Cj (x2))2 + + Gi Oe) which is 
also ahomomorphism. It is evident that if W, is related to W2, then d(W,) = @(W2). 
Thus, ¢ induces a homomorphism 77 from F(X) to H. Clearly, 7 makes the required 
diagram commutative. It is also clear that the only homomorphism which can make 
the required diagram commutative is 7). 

The pair (F(X), i) thus obtained is called the free group on X. It is clear that 7 
is an injective map, and we identify X as a set of generators of F(X). 

Free groups satisfy the following lifting property, also called the projective prop- 
erty, in the category of groups. 


Theorem 10.4.1 Let 3 be a surjective homomorphism from G, to G2. Let F(X) 
be the free group on X, and f a homomorphism from F(X) to G2. Then, there 
exists a homomorphism n (not necessarily unique), called a lifting of f, such that 
the following diagram is commutative. 


G, 


f 
F(X) UG 


Proof Consider the following diagram, where the maps i, @ and f are given. 
J 
xX CG, 


f 
F(X) GG 


Since (3 is surjective, 3-'(f (i(x))) A @ for all x € X. By the axiom of choice, we 
get a map j from X to G,; such that Goj = foi. Thus, the square and the triangles 
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X F(X) G2 and X G; G2 are commutative. From the universal property of the free 
group, there exists a unique homomorphism 77 from F(X) to G; such that the triangle 
X F(X) G, is commutative. Clearly, (Gon)oi = Go(noi) = Boj = foi. Thus, 
Gon and f both make the triangle X F(X) G2 commutative. From the universal 
property of the free group, it follows that Gon = f, and so the triangle F(X) G; G2 
is commutative. t 


Theorem 10.4.2 Every group is quotient of a free group. 


Proof Let G be group. Let F(G) denote the free group on the set part of the group 
G. We have the identity map /g from the set G to the group G. From the universal 
property of free group, there exists a unique homomorphism 7 from F(G) to G 
such that joi = Ig. Thus, 77 is surjective. The result follows from the fundamental 
theorem of homomorphism. ft 


Given any set )t(X) of words in X, it defines a unique group F(X) /H, where H is 
the normal subgroup of F(X) generated by {[W] | W € R(X)}. If a group G is iso- 
morphic to the group F(X)/#H described above, then we say that < X ; i(X) > is 
a presentation of G. In particular, < X ; % > is presentation of the free group 
F(X), also called the free presentation. The elements of i(X) are called the 


defining relators of the presentation. Incase X = {x,,x2,...,x,}1s a finite set and 
N(X) = {R,, Ro, ...R,} is also finite, we denote the presentation < X ; R(X) > 
by < x1, X2,...,%- 3 Ry, Ro,...R,; >. Such a presentation is called a finite presen- 


tation. In general, a group need not have a finite presentation (the additive group 
Q of rational numbers does not have any finite presentation, for it is not finitely 
generated. In fact, it cannot have finitely many defining relators also). Clearly, a 
group has so many presentations. For example, < x; x7 > and < x; x+,x° >, both 
are presentations of a cyclic group of order 2. 

Theorem 10.4.2 says that every group has a presentation. In general, it is very 
difficult to recognize a group by its presentation. We give some simple examples 
of presentations. Presentations for nonabelian groups of order pg and nonabelian 
groups of order p? have already been discussed in earlier sections. 


Theorem 10.4.3 (von Dyck’s Theorem) Let < X ; 3t(X) > be a presentation of a 
group G and < X ; V(X) > be a presentation of a group G' such that R(X) © 
Nt’ (X). Then, G’ is a homomorphic image of G. 


Proof Since X(X) C i’ (X), the normal subgroup H generated by t(X) is contained 
in the normal subgroup H’ generated by W(X). By the first isomorphism theorem 
F(X)/H' & (F(X)/H)/(H'/H). Thus, G’ is isomorphic to a quotient group of G. £ 


Example 10.4.4 Let us describe and recognize the group given by the presentation 
< x,y; x3, y?, xyx7*y >. We look for a group which is generated by two elements, 
one of order 3 and the other of order 2. The simplest such group is $3 which is 
generated by the cycle o = (123) and the transposition = (12). It is also evident 
that ot o~7 is the identity permutation. 
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Let F(x, y) denote the free group on the set {x, y}. From the universal property 
of the free group, we have a unique surjective homomorphism 77 from F(x, y) to $3 
with n(x) = a and7n(y) = 7. Let A be the normal subgroup of F(x, y) generated 
by the set {x°, y?, xyx~?y} of defining relations. Since a3, 7? and ot o~*7 are 
identity permutations, it follows that H C ker 7. From the fundamental theorem of 
homomorphism, 77 induces a surjective homomorphism 7 from F(x, y)/H to $3. 

Now, we try to enumerate the cosets of F(x, y) modulo H and find a bound 
for the order of F(x, y)/H, if it happens to be finite. H is the identity coset. Con- 
sider the set S = {H, [x]H, [x7]H, LIA, [LyJixJH = [yx]A, [yx?]H} of cosets 
of F(x, y) mod H. We show that [x]S = S = [y]S. Clearly, [x]H, [x]x]H = 
L7H, [x]b7]H Del] = A belong to S. Again, [x][y]H = [xy]H = [yx?]H, 
for [xyx-*y] € H. Further, [x]l[yx]JH = [xyx]H = [y]H, for [xyxy7'] = 
[xyxy] = xyyx? € H. Also, [x]lyx7]H = [xy]Hb2]H = [yx] L714 [x4] 
HT [yx]H. This shows that [x]$ = S. Similarly, [y]S = S. Since {[x], [y]} generate 
F(x, y), it follows that S is closed under product of any element of F(x, y) from 
left. Hence, S is the set of all cosets of F(x, y) modulo H. This shows that the group 
F(x, y)/H contains at the most six elements. Since 77 is surjective homomorphism 
from F(x, y)/H to $3, it is bijective. Thus, the given presentation describes $3. 


More generally, the following example gives a presentation of S,, for each n. 


Example 10.4.5 The symmetric group S,,n > 2 has a presentation 


SM Ryne Xand XP, ty), Gee) | letan= tle jen—2,| 
k-1|>2,1<k<n-1,1<Il<n-1>. 


Proof Let X denote the set {x1, X2,...,%,-1}. Define a map a from X to S, by 
a(x;) = (i,i +1). From the universal property of a free group, we have a unique 
homomorphism 7) from F(X) to S,, which takes x; to (i, i + 1). Since S, is generated 
by the set {(@,i+ 1) | 1<i<n-— 1}, 7 is a surjective homomorphism. Further, 
n(x?) — (n(xj))? = (,i+1)? = 1. Similarly, it is easy to observe that 7) takes 
all the defining relators of the given presentation to identity. This means that the 
normal subgroup H of F(X) generated by the set of defining relators is contained in 
the kernel of 7). Thus, 7) induces a surjective homomorphism (fundamental theorem 
of homomorphism) from F(X)/H to S,. 

Thus, to show that 7 is an isomorphism, it is sufficient to show that F(X)/H 
contains at most n! elements. This we prove by induction onn. Ifn = 2, F(X)/H 
is cyclic group of order 2. Assume the result for n — 1,n > 3. Let K be the sub- 
group of F(X)/H generated by {[x1], [x2], ..., [%,-2]}. Clearly, there is a surjective 
homomorphism from the group with presentation 


Min Rie cna a ape ay |l<i<n-2,1<j<n-3,| 
k=1|=2 12k an=2,12len=2> 


to K. Thus, by the induction hypothesis | K |< (mn — 1)!. We show that there can be at 
most n cosets of F(X)/H modulo K. More precisely, we show that every element of 
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F (X)/H lies in at least one of the left cosets [x1x2 +++ Xn—1]K, [%2%3-+++Xn-1]K,--- 
[x,-1]K, K. It is sufficient to show that product of an element from left to an ele- 
ment of some of these cosets is again an element of some of these cosets. Since 
[x1], [x2], ..., [xn] generate F(X)/H, it is sufficient to show that for each i, prod- 
uct from left of [x;] to the cosets above is again one of those cosets. Suppose that 
i <j —2 <n -—23. Then, since (x;x;)* is a relator, [xixj]? = [x J|xlbillxj] = e 
in F(X)/H. Since Ee and xe are also relators , it follows that [x;]> = [xj 1? =e: 
Hence, in this case [x;][x;] = [x,][x;], and so 
[xi] - [x jx joi + + Xn] K 
= [xilbejlbejqi) ++ ni) K 

Xi|Pejsid ++ n-dbi)K 
Xj |Pejsid ++ tn K 
XjX joie Mn K, 
for [x;] €¢ K. Next, suppose thati = j — 1. Then, 


[xix |ejzi)- ++ na K = Lixin. Xi42 +++ X11 K. 
Ifi = j, then also 
[xi |beixiga ++ Xn) K = [xi41Xi42°+- Xn) K, 


for x is a relator. Suppose that i > j + 1. Then 

[x ][4 jx j41 +++ Xn) K 

= [x jXj44 te XX{ NXg + Xn] K 

= [xsl bejsi) + beixi ii ei xi42 ++ Xn) K. 

Since (x;_1x;)°, a and - are relators, it follows that [x;x;-1x;] = [xj;—1%;x;-1], 
and further, since [x ;_;] commutes with all [x;], k > i, we find that 

[x ][4 Xx j41 +++ Xn-1 1K 

= [XjXjr e+ Xp G1) K 

= [xjXj41-++X,-1]K. The proof is complete. tt 


The method used in the above two examples to determine an upper bound of the 
order of a group given by a presentation is called the coset enumeration method. 
Many variants of this method are used to recognize finite groups given by presenta- 
tions. One can develop a program and take help of computer to describe finite groups 
given by presentations. For details, see “Handbook on Computational Group Theory’ 
by D.F. Holt, B. Eick, and O Brien. 

Motivated from some classification problems in Topology, Dehn, in 1911, posed 
the following three problems known as Dehn’s fundamental problems. 

1. Word problem. Given a presentation < X ; )t(X) >, to find an algorithm 
which decides, in finite number of steps, whether a word W defines the identity 
element in F(X)/H. 

2. Conjugacy problem or Transformation problem. Given a presentation < 
X ; R(X) >, to find an algorithm which decides, in finite number of steps, whether 
a pair of words W; and W2 determine conjugate elements in F(X)/H. 
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3. Isomorphism problem. Given two presentations < X ; }t(X) > and < 
Y¥% R (Y ) >, to find an algorithm which decides, in finite number of steps, if they 
define isomorphic groups. 

Clearly, the solution to the second problem gives a solution to the first problem, 
for a word W defines the identity element if and only if it defines the element which 
conjugate to the element defined by the empty word. The third problem is the most 
difficult. 

In general, for a given presentation, there may not exist any algorithm for the word 
problem (conjugacy problem, isomorphism problem). In fact, Novikov and Boone 
in 1955, and later many more people gave examples of finite presentations, where 
such an algorithm, even for word problem, does not exist. Here, we shall describe 
these algorithms for some simple presentations. 


Theorem 10.4.6 Word and conjugacy problems for a free presentation are solvable. 


Proof Let < X ; > bea free presentation. Let W be a word in the defining symbols. 
We define free reduction p(W) of a word W by the induction on the length of the 
word W.If/(W) = 0, then W is the empty word # and we define p(4) = 0. Assume 
that p(W) has been defined for all words of length at most n with further property 


that ie(W)) = < ik alana a word Ways where @,41; = +1. Suppose that 


p(W) = yi'ys?..- 0. Then define p(Wxp') = p(W)x,1') provided that y, 4 
Xn41 Or Bs A —An41. Uf Xp41 = ys andalso 8, = —ay+41, then define p(Wxiit') = 
p(e(W) xp). Clearly, 1(o(Wx,'1'')) < (Wx,"\'). The definition of the map p is 
complete. The following properties of p can be proved easily by the induction on the 


length of the word. 


(i) pY) = 

(ii) [W] = [p(W)] in the free group for all words W. 

(iii) p(W) is the word of smallest length in the class [W]. 

(iv) p(p(W)) = p(W). 

(v) [Wi] = [Wo] if and only if p(Wi) = p(W,). 

(vi) p(WiW2) = p(e(W1)W2) = p(Wip(W2)) = p(p(Wi)p(W2)). 


It is clear that W defines identity in the free presentation if and only p(W) = 
%. This gives the solution of word problem for free presentation. p is called the 
free reduction. 

For the conjugacy problem, we define o(W) for each word W by induction on 
l(p(W)). If p(W) = @, then we define o(W) = @. Assume that o(W) has already 
been felines for all those words for which /(p(W)) <n. Suppose that p(W) = 


ye" yh ee eee Then, o(W) = p(W) provided that y; A y,41 oF a FX —Bn4t. 


If yy = yp, and 6; = —(,41, then define o(W) = yee + Bs . It can be 


checked ae the induction that words W; and W) define ecnpaeaie < oe in the 
free group if and only if co(W;) = o(W2). This solves the conjugacy problem for 
free presentation. The map a is called the cyclic reduction. tt 


Corollary 10.4.7 Every free group is torsion-free in the sense that every nonidentity 
element is of infinite order. 
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Proof Let [W] = [p(W)] be a nonidentity element of the free group. Then, p(W) 
is not the empty word. If a(o(W)) = p(W), then it is clear that p((p(W))”") = 
(p(W))” which is not the empty word. Further, since [W] is not identity, it is not con- 
jugate to identity, and hence, [7(p(W))] which is conjugate to [o(W)] cannot define 
identity element. Clearly, [p(W)]” is conjugate to [7(p(W))]” which is nonidentity. # 


Now, we discuss isomorphism problem for presentations. Let < X; {i > be a pre- 
sentation. We say that a word W in X is derivable from ‘if itis reducible to the empty 
word after finite number of steps under the operation of deletions and insertions of 
members of ‘it U KR! and also of trivial relators. The following four transformations 
on the presentation < X; it > are called the Tietze transformations. 

T,. If a word W in X is derivable from ‘, then put W in the set {i of defining 
relators. 

T». If a defining relator P € ‘i is derivable from the rest of the relators in %, then 
delete it from the set St of defining relators. 

T;. Let W be a word in X. Insert a symbol u in the set X of generating set and 
also insert u~!'W or Wu~' in the set % of defining relators. 

T4. If there is a relator R in t of the form x~!W or Wx—!, where W is a word not 
involving the symbol x, then delete x from the generating set X, the relator R from 
the set 3¢ of defining relations and also substitute W at the place of x in all the rest 
of the relators in fi — {R}. 


Theorem 10.4.8 Let < X;3t > and < Y; 8 > be two presentations. They define 
isomorphic groups if and only if one can be obtained from the other by using Tietze 
transformations. 


Proof Suppose that < Y; t' > is obtained from < X; St > by using T; and T>. Then, 
X = Y and the normal subgroups of F(X) generated by St and 9’ are same, and 
therefore, they define isomorphic groups. If < Y; i > is obtained from < X; Kt > 
by 73, then X C Y,andthemap[W] < i >~» [W] < X > defines an isomorphism. 
Since Ty is the inverse transformation of 73, it also does not change the group. 
Conversely, suppose that the given two presentations define isomorphic groups. 
Let : be the isomorphism from F(X)/ <> to F(Y)/ <® >. Suppose that 
o([x] <HR>) = [Wy] < Xt >, where W, is a word in Y, and ¢7![y] < R>= 
|W. ae >, where W, is a word in X. Then, the presentation < X DY; RU RU 
{x7 TW, |x € X} Ut w, | y € Y} > defines a group which is isomorphic to both 
the groups. tt 


In general, there is no algorithm which decides in finite number of steps whether 
given two presentations define isomorphic groups. The philosophy of this part of 
group theory is to search some effective computable algebraic invariant of the group 
in terms of presentation. More precisely, given a class & of presentations, we try 
to have association f which associates to each presentation P € & a computable 
invariant f(P). The discussion of these invariants is beyond the scope of this book. 

We illustrate the use of Tietze transformations to solve isomorphism problem in 
some very simple cases by means of examples. 
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Example 10.4.9 The presentations 
< a,b,c; b*, (bc)? > 


and 
ee ee) 
<X,Y,%Y 2 > 


define isomorphic groups. 


Proof We show that the second presentation can be obtained from the first by 
means of Tietze transformations. Use 73 to adjoin generators x, y,z and rela- 
tors x-!a, y-'b, z~'be in the first presentation. Thus, we arrive at the presenta- 
tion < a,b,c, x,y,z; x7'a, y~'b, z7'be, b?, (bc)? >. Now, use Ty to remove the 
generators a,b,c from this presentation, and thus, we arrive at the presentation 
< x,y,z; y’, 2” >. It also follows that the group presented contains an element [x] 
of infinite order. tt 


Example 10.4.10 The presentations 
< a,b; a‘, b*; a*b’a'b"! > 


and 


1,.-3 


LIE Y aay eS 
define isomorphic groups and it is of order 20. 


Proof Adjoin the generators x, y with relators x~!ab, y~'b in the first presentation 
and observe that (ab)> is derivable in the first presentation. The rest follows easily. # 


We conclude the section by stating the following theorem of Nelson and Schreier 
without proof. For further study in the theory of presentation of groups, the reader 
is referred to combinatorial group theory by Magnus, Karras, and Solitor. 


Theorem 10.4.11 Every subgroup of a free group is free. 
Exercises 


10.4.1 Show that the presentation < x, y;x°, y*, (xy)? > describes the symmet- 
ric group of degree 4, and then show that the word problem is solvable for this 
presentation. 


10.4.2 Show that the word and conjugacy problem are solvable for the presentation 
My No deg Bet x" ; ae ...,X," >. Determine the number of conjugacy classes 
of elements of finite orders. 


10.4.3 Let d be the g.c.d of m and n? — 1. Show that the presentations 


. ym —1 —n 
<x,y;x",y?,y xyx" > 
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and 
<a,b.a‘,b’,b aba" > 


define isomorphic groups. 


10.4.4 Let X = {a, |n¢N}andR = {a,(ay4,)~“*}. Solve the word problem 
for the presentation < X; R >. Show that it is a presentation of the additive group 
of rational numbers. 


10.4.5 Find the order of the group having presentation 
: -1 -1 -1 -1 
< 4), 42,...,4n;4|d2d3, ,d2d3d4 ,...,An—24n-14, ,Qn-14na, > 
forn < 6. Is it finite for n > 7? 


10.4.6 Solve the word problem for the presentation < a,b; a", b™?a"b"™4 >. 


10.4.7 Identify the groups with following presentations. 


(i) <a,b; a’, b*,a*b? >. 
(ii) < a,b; a’, b’, (ab) >. 


10.4.8 Find, if possible, the orders of the groups with following presentations. 


(i) <a,b;a*, b>, aba*b >. 
(ii) < a,b,c; a‘*, b*,c?, a2c?, abc? >. 
(ili) <a, b,c; a’, b>, ©, abe >. 


10.4.9 Determine, if possible, the order of the commutator subgroups of the groups 
presented in Exercises 10.4.7 and 10.4.8. 


10.4.10 Let < X,; 3; > and < Xz; Ny > be presentations of a group G. Let < 
X}; Ni >and < X5; NL > be presentations of a group G’. Assume that X; () X} = 
6 = X2{)X4 (note that there is no loss of generality in this assumption). Show 
that the presentations < X; ) X}; #, UM, > and < X2U X45; Ro UN, > define 
isomorphic groups. The group, thus obtained, is called the free product of G and 
G’. The free product of G and G’ is denoted by G « G’. 

Hint. Use Theorem 10.4.8. 


10.4.11 Show that there is an injective homomorphism i from G to G * G’, and 
there is an injective homomorphism j from G’ to G « G’ such that if there is a group 
H together with a homomorphism jz from G to H, and a homomorphism v from G’ 
to H, then there is a unique homomorphism ¢ from G * G’ to H such that doi = yu 
and goj = V. 


Chapter 11 
Arithmetic in Rings 


This chapter is devoted to the study of rings in relation to their arithmetical properties. 


11.1 Division in Rings 


In this section, we introduce some arithmetical concepts in rings. Let R be a commu- 
tative integral domain with identity, and R* denotes the set of nonzero elements of 
R. The multiplication in R is denoted by juxtaposition. Thus, the product of a and b 
is denoted by ab. Anelement a € R* is said to divide an element b € R*, if there is 
an element c € R* such that b = ac. The notation a/b is used to say that a divides 
b. If a divides b, then we say that a is a divisor or a factor of b. We also say that b 
is a multiple of a. 

Let u be aunit of R anda € R*.Thena = uu~'a. Since u—! € R*, u/a. Thus, 


Proposition 11.1.1. Every unit element of R divides every nonzero element of R. { 


Proposition 11.1.2. Leta, b € R*. The following two conditions are equivalent. 
(i) a/b and b/a. 
(ii) a and b differ by a unit in the sense that there is a unit u such thatb = au. 


Proof (i) = > (ii). Assume (i). Then b = ac anda = bd for somec,d € R*. 
But, then bl = b = bdcandal = a = acd. Sincea,b € R*, and R is an 
integral domain, we getdc = 1 = cd. Thus, a and b differ by a unit. 

(ii) => (i). Suppose that b = au, where u is a unit. Then clearly, a/b and also 
sincea = bu, b/a. t 


We say that a,b € R* are associates to each other if they differ by a unit, or 
equivalently, a/b and b/a. The notation a ~ b is used to say that a is an associate 
of b. It is easy to see that the relation ~ of ‘being associate to’ is an equivalence 
© Springer Nature Singapore Pte Ltd. 2017 387 
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relation. Further, a ~ b if and only if whenever a/c, b also divides c. Thus, while 
studying the arithmetic properties of rings, we need not distinguish associates. 

Let a € R*. Then units of R and associates of a are always divisors of a. They 
are called improper divisors. Other divisors are called proper divisors of a. 

Anonunit element p € R* is called anirreducible element of R if it has no proper 
divisors. It is said to be a prime element of R if p/ab implies that p/a or p/b. 

An element d € R* is called a greatest common divisor (g.c.d) or a greatest 
common factor (g.c.f) of a, b € R*, if 

(i) d/a and d/b, and 

(ii) d'/a, d'/b implies that d'/d. 

If d; and d, are greatest common divisors of a,b € R*, then d;/dz and d2/d\. 
Thus, any two greatest common divisors of a, b € R* are associates to each other. 
Therefore, ifa, b € R* has a greatest common divisor, then it is unique upto associate. 
A g.c.d of a and b will be denoted by (a, b). 

An element m € R* is called a least common multiple (1.c.m) of a and D, if 

(i) a/m and b/m, and 

(ii) a/m’, b/m' implies that m/m’. 

As in case of g.c.d, l.c.m, if exists, is unique up to associates. A l.c.m of a and b 
will be denoted by [a, b]. 

The concepts defined above are well understood in the ring of integers right from 
our school days. The group U(Z) of units of the ring of integers is the two element 
group {1, —1}. Here, irreducible and prime elements are same. Further, g.c.d and 
l.c.m exist, and there is an algorithm to find g.c.d and |.c.m of any two nonzero 
integers. Up to being associates, product of any two nonzero integers is same as the 
product of their g.c.d and I.c.m (see Chap. 3). 


Example 11.1.3 The subset Z[i] = {a+ bi | a,b € Z} of complex numbers 
under the usual addition and multiplication of complex numbers is a commutative 
integral domain with identity. The members of Z[i] are called Gaussian integers. 
The Gaussian integer 2 + 3i divides —1 + 57, for—1+57 = (2+ 31)(1 +7). The 
Gaussian integer 2 + 37 does not divide —1+4i, forif—-1+4i = (24+3i)(a+ bi), 
then taking the square of the moduli of each side, we get that 17 = 13(a? +b’). 
This is impossible, for (a* + b*) € N. 

Let a + bi be a unit of Z[i]. Then, there is a (c + di) € Z[i] such that (a + 
bi)(c+ di) = 1. Again, taking the squares of the moduli of both sides, we get that 
(a? +b*)(c? +d?) = 1.Since a? +b? and c* +d? are natural numbers, a7+b* = 1. 
This implies thata = tlandb = Oora = Oandb = +1. Thus, the group 
U(Z{i]) of units of Z[i] is the cyclic group {1, i, —1, —i} of order 4. As such, the 
associates of a + bi area + bi, —b + ai, —(a+ bi), and b — ai. 

2 = 2+0i is not irreducible in Z[i], for2 = (1+i)(1 —i), where neither | +i 
nor | — i is a unit in Z[i]. Observe that 2 is irreducible in the subring Z of Z[i]. 

The Gaussian integer | + 7 is irreducible in Z[i], for suppose that (1 +7) = 
(a + bi)(c + di). Then, taking the squares of moduli of both sides, we get that 
2 = (a 4+b)(c? +d’). Since a? + b* and c* + d? are natural numbers, 
ath = lor’+d = 1.lfa?+b? = 1,thena = tlandb = Oora = 0 
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and b = +1. This means that a + bi is unit. Similarly, in the other case, c+ di is a 
unit. 

We shall show later in this chapter that irreducible elements and prime elements in 
Z{i] are same (as in the case of Z). We will also show that any two nonzero elements 
of Z[i] have g.c.d as well as l.c.m and as in case of Z, we have also algorithms to 
find them. 


Example 11.1.4 (Arithmetic in Z[./—5]). Consider the ring Z[/—5] = {a+ 
b/—5 | a, b € Z}. Leta+b./—5 be aunit in Z[./—5]. Then, there exists c+d./—5 € 
Z[/—5] such that (a + b\/—5)(c +dV/—5) = 1. Equating the squares of moduli 
of each side, we get that (a? + 5b?)(c? + 5d*) = 1. Since a? + 5b? and c? + 5d? 
are natural numbers, a” + 5b* = 1. But, thena = +1 andb = 0. This shows 
that the group U(Z[./—5]) of units of Z[./—5] is {1, —1}. Thus, the associates of 
a+b/—5 in Z[./—5] are +(a + b/—5). 

The element 2 = 2+ 0-./—5 is irreducible in Z[/—5]. For, suppose that 
2 = (a+b V/—5)(c + dV—5). Then, again equating the squares of moduli of each 
sides, we get that4 = (a? +5b?)(c? + 5d”). Since a? + 5b? and c* + 5d? are natural 
numbers, a2+5b? = 1or2or4. Clearly, a*+5b? cannot be 2. Ifa?+5b? = 1, then 
a = +landb = Oand inthis case,a+b/—5 = +1 isaunit. Ifa2+5b? = 4, 
then c? + 5d? = 1, and so in this case, c + d./—5 is a unit. This shows that 2 is 
irreducible. Using the same argument as above, it follows that 1 + J/—5, 1—J—5, 
and their associates are irreducible. Clearly, 2 does not divide | + /—5, and it does 
not divide 1 — ./—5. But 2 divides 6 = (1 + /—5)(1 — /—5). This shows that 
2 is not a prime element of Z[./—5]. It also follows that 1 + /—5, 1 — ./—5 and 
their associates are also not primes. Thus, contrary to the case in the domain Z of 
integers, an irreducible element in the integral domain Z[/—5] need not be a prime 
element. 

Again, a pair of nonzero elements in Z[,/—5] need not have a g.c.d. For example, 
considera = 6andb = 2- (1+ /—5). We show that a and b have no greatest 
common divisors. Suppose contrary. Let d be a greatest common divisor of a and b. 
Since 2/a and 2/b, 2/d.Letd = 2-(m+n/—5). Since d/b, (m+nJ/—5)/(1+ 
./—5). Further, since 1 + ./—5 is irreducible, m + n./—5 is a unit or an associate of 
1+ /—5. This shows that d ~ 2 or d ~ b. Since 1 + ./—5 is a common divisor of 
a and b and 1 + /—5 does not divide 2, d can not be an associate of 2. Again, since 
b does not divide a, d can not be an associate of b. Thus, such ad does not exist. 

Finally, a pair of elements in Z[,/—5] may have a g.c.d but no l.c.m. For example, 
consider the elements 2 and 1 + /—5 of Z[/—5]. Since 2 and 1 + /—5 are both 
irreducible, and they are nonassociates, units and units are the only common divisors. 
Thus, | is ag.c.d of 2 and 1-+./—5. We show that they do not have any I.c.m. Suppose 
contrary. Let m be al.c.m of 2 and 1 + /—5. Then, m = 2(a + b./—5) for some 
a,b € Z. Since 2 divides 2(1 + /—5) and 1 + /—5 also divides 2(1 + /—5), it 
follows that 2(a+b./—5) divides 2(1+/—5). But, then a+b./—5 divides 1+./—5. 
Since 1 + ./—5 is irreducible, a + b./—5 is a unit or an associate of 1 + ./—5. Hence, 
m ~ 2orm ~ 2(1 + /—5). Since the common divisor 1 + ./—5 does not divide 2, 
m can not be an associate of 2. Also since 2(1 + ./—5) does not divide, the common 
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multiple 6 of 2 and 1 + /—5, it can not be an associate of 2(1 + /—5). This proves 
the impossibility of the existence of l.c.m of 2 and 1 + /—5S. 


Remark 11.1.5 It becomes evident from the above example that the arithmetic of 
Z{/ —5] is significantly different from that of Z and Z[i]. In general, an irreducible 
element need not be a prime element. Also g.c.d (or l.c.m) need not exist in general. 


Proposition 11.1.6 Every prime element is irreducible. 


Proof Let p be a prime element, and suppose that p = ab. Then, p/(ab). Since p 
is assumed to be a prime element, p/a or p/b. Suppose that p/a. Then,a = pc 
for some c € R*. Hence, p = pcb. By the restricted cancelation law in an integral 
domain, cb = 1. Since R is commutative, bc = 1 and bis a unit. Thus, p ~ a. 
Similarly, if p/b, then a is a unit and p ~ b. This shows that p has no proper 
divisors. tt 


Proposition 11.1.7 Let R be acommutative integral domain with identity. Leta, b € 
R*. Let d be a greatest common divisor and m a least common multiple of a and b. 
Then ab ~ md. 


Proof Since a/(a- b) and b/(a- b), m/(a- b). Suppose thata-b = m-c. Since 
a/mandb/m, c/a and also c/b. But, then c/d. Hence,a-b = m-c divides m- d. 
Now, we show that m - d divides a- b. Since d isag.c.dofaandb, a = d-uand 
b = d-vforsomeu,v € R*. But, thena-b = d-d-u-v.Clearly,a/(d-u-v) 
and b/(d-u-v). Hence, m/(d-u-v). This shows that m - d divides a - b. tt 


Proposition 11.1.8 Let R be a commutative integral domain with identity and 
a,b,c € R*. Then 

(i) (c-a,c-b)~c- (a,b), 

and 

(ii) ((a, b), c) ~ (a, (b, €)) 

provided that both sides exist. 


Proof (i) Since c/(ca) and c/(cb), c/(ca, cb). Suppose that (ca, cb) = cd. Then, 
(cd)/(ca) and (cd)/(cb). Hence, d/a and d/b. Thus, cd divides c(a, b), and so 
(ca, cb) divides c(a, b). Also, since (a, b)/a and (a, b)/b, c(a, b) divides ca, and 
c(a, b) divides cb. This shows that c(a, b) divides (ca, cb). Thus, c(a, b) ~ (ca, cb). 

The proof of (ii) is left as an exercise. tt 


Corollary 11.1.9 Let R be a commutative integral domain with identity. Suppose 
that g.c.d of any two nonzero elements of R exist in R. Then, every irreducible element 
of R is a prime element. 


Proof Suppose that g.c.d of any two nonzero elements of R exist. Let p be an 
irreducible element of R. Suppose that p does not divide a, and also p does not 
divide b. Since only divisors of p are units and associates of p, it follows that 
(p,a) ~ 1 ~ (p, b). Now, using the above proposition, we have 
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(p, ab) ~ ((p, pa), ab) ~ (p, (pa, pb)) ~ (p, a(p, b)) ~ (p, a) ~ 1. 
Thus, p does not divide ab. This shows that p is a prime element. tt 


Arithmetic and Ideals. 


Let R be a commutative integral domain with identity. Let a € R. Then, the ideal of 
R generated by a is 


Ra = {xa|x € R}. 


An ideal generated by a single element is called a principal ideal. Clearly, a is a 
unit if and only if Ra = R. 


Proposition 11.1.10 Let R be a commutative integral domain with identity. Then 
the following hold: 

(i) a/b ifand only if Rb C Ra. 

(ii) a ~ b if and only if Ra = Rb. 

(iii) a is a proper divisor of b if and only if Rb is properly contained in Ra and 
Ra ZR. 

(iv) m is Lc.m of a and b if and only if Rm = Ra{)\ Rb. 

(v) Anelementd € Risag.c.dofaandb ifand only if Rd is the smallest principal 
ideal containing the ideal Ra + Rb generated by a and b. 


Proof (i) Suppose that a/b. Then, b = ca for some c € R*. But, then Rb = 
Rca C Ra. Conversely, if Rb C Ra, then since b € Rb, b € Ra. This implies that 
b = ca forsomec eé R. 

(ii) We know that a ~ b if and only if a/b and b/a. The result follows from (1). 

(iii) a is a proper divisor of b if and only if a is nonunit divisor of b and b does 
not divide a. This is equivalent to say that Ra #4 R, Ra & Rb, and Rb C Ra. 

(iv) Suppose that m is a l.c.m of a and b. Then, a/m and b/m and so Rm © 
Ra() Rb. Further, if x € Raf) Rb, then a/x and b/x. Since m is l.c.m, m/x. 
But, then x € Rm. This shows that Rm = Ra() Rb. Conversely, suppose that 
Rm = Ra() Rb. Then, Rm C Ra and Rm C Rb, and hence, a/m and b/m. Also 
if a/n and b/n, thenn € Ra() Rb = Rm. But, then m/n. This shows that m is 
l.c.m of a and b. 

(v) Suppose that d is a g.c.d of a and b. Then, d/a and d/b. Hence, d divides 
xa+ yb forallx, y € R. This means that Ra + Rb C Rd. Next,if Ra + Rb C Rd’, 
then Ra C Rd’ and Rb C Ra". This means that d’/a and d’/b. Since d is g.c.d of a 
and b, d'/d andso Ra + RbC Rd C Rd’. 

Conversely, let Rd be the smallest principal ideal containing Ra + Rb. Then, 
Ra © Rd and Rb C Rad. This means that d/a and d/b. Further, if d’/a and d'/b. 
Then, Ra C Rd’, and Rb C Rd’. But, then Ra + Rb C Rd’. Since Rd is the 
smallest principal ideal containing Ra + Rb, Rd C Rad’. This means that d’/d, 
and so d is a g.c.d of a and b. tt 
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Remark 11.1.11 Clearly, Ra + Rb = Rd implies that d is a g.c.d of a and b. 
However, d is a g.c.d of a, and b does not imply that Ra + Rb = Rd: For 
example, consider the polynomial ring Z[X]. It is easy to see that 2 and X are 
irreducible elements of Z[X]. Hence, 1 is g.c.d of 2 and X. But Z[X]-2 + Z[X].X 
is the set of polynomials over Z whose constant terms are divisible by 2. Thus, 
Z[X]-2 + Z[X]-X AZ[X]-1. 


Exercises 
11.1.1 Find out the group of units of Z[,/—3]. 


11.1.2 Show that 2 and 1 + ./—3 are irreducible in Z[,/ —3] but they are not primes. 
Do they have |.c.m? 


11.1.3 Show that X is prime element in Z[X]. Show that the ideal generated by X 
is a prime ideal but it is not maximal. 


11.1.4 Let a,b € Z*. Show that a + bX is irreducible in Z[X] if and only if 
(a,b) = 1. 


11.1.5 Show that X? + 1 is irreducible in R[X], where R is the field of real numbers. 
Show further that aX? + bX + c is irreducible if and only if b? — 4ac < 0. 


11.1.6 Show that X? + @ is irreducible in Zp[X] if and only if ce = —-l. 


11.1.7 Show that every ideal in Z is a principal ideal. What are prime ideals of Z? 
What are maximal ideals of Z? 


11.1.8 Show that [ca, cb] ~ c[a, b] provided that both side exist. 
11.1.9 Show that [[a, b], c] ~ La, [b, c]] provided that both side exist. 


11.1.10 Find out the group of units of Z[./—7]. Show that 3 + ./—7 is irreducible. 
Is it prime? 


11.1.11 Show that X? + 1 is irreducible in Zp[X] if and only if p = 1(mod 4). 
11.1.12 Show that Z[,/—5] and Z[i] are not isomorphic. 


11.1.13 Consider the integral domain Z[w] = {a+ bw | a,b € Z}, where w 
is a primitive cube root of 1. Show that a + bw is a unit in Z[w] if and only if 
a? —ab+b* = 1. 


11.1.14 Find the group of units of Z[w]. 
11.1.15 Find the group of units of Z[,/—2]. 


11.1.16 Show that a+ bi is irreducible in Z[i] provided that a? +b? is an irreducible 
integer. 
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11.1.17 Show that if a? — ab + B? is irreducible integer, then a + bw is irreducible 
in Z[w]. 


11.1.18 Suppose that a* + 2b? is an irreducible integer. Show that a + b./—2 is an 
irreducible element of Z[,/—2]. 


11.1.19 Let a,b € Z. Show that a/b in Z if and only if a/b in Z[i]. 


11.1.20 Let a,b € Z. Show that g.c.d of a and b in Z is same as a g.c.d of a and b 
in Z[i]. 


11.1.21 Determine the integral points on the circle x7 + y* =p, and also the 
integral points on the ellipse x7 + 5y* = p for all primes p < 19. 


11.2 Principal Ideal Domains 


Definition 11.2.1 A commutative integral domain R with identity is called a 
principal ideal domain if every ideal of R is a principal ideal. If R is a princi- 
pal ideal domain, then in short we express it by saying that R is a P.I.D. 


Thus, a commutative integral domain with identity is a principal ideal domain if 
and only if every ideal of R is of the form Ra for some a € R. The ring Z of integers 
is a principal ideal domain, for any ideal (indeed, any subgroup) of Z is of the mZ for 
some integer m. Every field is trivially a principal ideal domain, for the only ideals 
are {0} and the field itself. A little later, we shall have more examples of principal 
ideal domains. 


Proposition 11.2.2 Greatest common divisor of any pair of nonzero elements in a 
PLD. exist. Further more, if d is ag.c.d ofaand bina P1I.D R, thend = ua + vb 
for some u,ve R. 


Proof Let R be a principal ideal domain and a, b be nonzero elements of R. Since 
R is a principal ideal domain, the ideal Ra + Rb is a principal ideal. Hence, 
Ra + Rb = Rd forsomed é€ R. From Proposition 11.1.10, d is the g.c.d of a and 
b. Sinced € Rd = Ra + Rb, d = uv + vb forsomeu,ve R. ft 


Corollary 11.2.3 In a principal ideal domain, every irreducible element is prime 
element. 


Proof Follows from Proposition 11.2.2 and Corollary 11.1.9. ft 


Thus, Z[./—5] is not a P.I.D. Observe that a subring of a P.I.D need not be a P.I.D, 
for example, the field C of complex numbers is a P.I.D whereas the subring Z[./—5] 
is not a P.I.D. However, 


Proposition 11.2.4 [fa homomorphic image (and hence, a difference ring) of a P.I.D 
is an integral domain, then it is also a PLD. 
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Proof Let f be a surjective homomorphism from a P.I.D R to an integral domain R’. 
Clearly, R’ is a commutative integral domain with identity. Let A be an ideal of R’. 
Then, f~'(A) is an ideal of R. Since R is a PI.D, f—'(A) is generated by a single 
element a (say). Then, A = f(f~'(A)) is generated by f(a). Thus, every ideal of 
R’ is a principal ideal. tt 


Proposition 11.2.5 Let R be a PI.D and p € R. Then the following conditions are 
equivalent. 

(i) p is an irreducible element. 

(ii) p is a prime element. 

(iii) Rp is a maximal ideal of R. 

(iv) R/Rp is a field. 


Proof Equivalence of (i) and (ii) follows from Proposition11.1.6, and 
Corollary 11.2.3. Equivalence of (iii) and (iv) follows from the fact that M is a 
maximal ideal if and only if R/M is a field. We prove the equivalence of (i) and (iii). 

Assume(i) Let p be an irreducible element, and A an ideal containing Rp. Suppose 
that A # Rp. Then, there exists an element a € A — Rp. Clearly, p does not divide 
a. Since p is irreducible, (p,a) = 1. From Proposition 11.2.2, 1 = xa + yp for 
some x, y € R. Since A is an ideal containing a and p, 1 € A. Hence, A = R. 
Thus, Rp is a maximal ideal. 

Assume that Rp is a maximal ideal and a/p. Then, Rp C Ra. Since Rp is a 
maximal ideal, Rp = Raor Ra = R. Hence, p ~ a ora is a unit. This shows 
that p is irreducible. ft 


Theorem 11.2.6 The polynomial ring R[X] is a PLD if and only if R is a field. 


Proof Suppose that R is a field and A an ideal of R[X]. We have to show that A is a 
principal ideal. If A = {0}, then there is nothing to do. Suppose that A 4 {0}. Let 
S = {deg(f(X)) | f(X) € A — {0}}. Clearly, S is a nonempty subset of NL {0}. 
By the well-ordering principle in N (J {0}, S has the least element deg (h(X)) (say), 
where h(X) € A — {0}. We show that A = R[X]h(X). Clearly, R[X]h(X) C A. 
Let f(X) € A. By the division algorithm in R[X], there exist g(X), r(X) € R[X] 
such that 


F(X) = q(X)h(X) + r(X), 


where r(X) = 0 or else deg(r(X)) < deg(h(X)). Since f(X), h(X) € A 
and A is an ideal, r(X) = f(X) — q(X)h(X) belongs to A. If r(X) 4 0, then 
deg(r(X)) € S and deg(r(X)) < deg(h(X)). This is a contradiction to the choice 
of h(X). Hence, r(X) = 0, and f(X) = q(X)h(X) € R[X]Jh(X). 

Conversely, suppose that R[X] is a P.I.D. We first observe that X is irreducible 
in R[X]. Suppose that X = f(X)g(X). Comparing the degrees, we may assume 
that deg(f(X)) = 0 and deg(g(X)) = 1. Suppose that f(X) = wu © R and 
g(X) = vX + w. Comparing the coefficient of X, we see that uv = 1, and 
so f(X) = wisaunit. Now, let a € R*. Since R[X] is assumed to be a P.I.D, a 
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and X will have a g.c.d. Since X is irreducible (a, X) = 1. By Proposition 11.2.2, 
1 = af(X) + Xg(X) for some f(X), g(X) € R[X]. Equating the constant term 
of both sides, we get that 1 = ab for some b € R. This shows that a is invertible, 
and so R is a field. tt 


Example 11.2.7 Z[X] is not a P.I.D, for Z is not a field. R[X] is a P.IL.D, where R 
denotes the field of real numbers. 


Proposition 11.2.8 Let F be a field. Then, the following hold: 

(i) Any polynomial aX + b of degree 1 is irreducible in F(X). 

(ii) Let f (X) be a polynomial in F[X] of degree greater than I which has a root in 
F.. Then, f (X) is reducible in F(X] 

(iii) A polynomial f (X) in F[X] of degree 2 or degree 3 is irreducible if and only if 
it has no root in F. 


Proof (i) Since the units of F[X] are precisely those of F' (Proposition 7.6.7), a 
polynomial aX + b of degree | can not be a unit in F[X]. Again, since all nonunit 
elements of F[X] are of positive degrees, and a polynomial of degree 1 can not be 
expressed as product of polynomials of positive degrees, it follows that any polyno- 
mial of degree | is irreducible. 

(11) Let f (X) be a polynomial of degree at least 2, and let a be aroot of f(X). By the 
factor theorem (Corollary 7.6.17), it follows that f(X) = (X — a)g(x) for some 
polynomial g(X). Comparing the degrees, we see that g(X) is of positive degree, 
and so it is not a unit. It follows that f(X) is product of two nonunits. 

(iii) Let f (X) be a polynomial of degree 2 or 3. Suppose that f(X) = g(X)h(X), 
where g(X) and A(X) are nonconstant polynomials. Comparing the degrees of both 
sides, we see that degg(X) = | or degh(X) = 1. Suppose that degg(X) = 1. Then, 
g(X) = aX + bwitha ¥ 0. But, then —b/a is a root of g(X), and hence, it is 
also a root of f(X). This shows that f(X) is reducible if and only if it has a root in 
F. tt 


Example 11.2.9 X? + 1 isan irreducible element in R[X], for it has no root in R. 
In general, a quadratic polynomial aX? + bX + c is an irreducible element in 
R[X] if and only if b> — 4ac < 0. Since every cubic polynomial over R has a 
root in R, it is reducible. Fundamental theorem of Algebra ensures that a polynomial 
with complex coefficients has all its roots in the field C of complex numbers. In 
particular, the irreducible elements in C[X] are precisely the linear polynomials. 
Further, it is evident that the conjugate of a complex root of a polynomial with 
real coefficients is again a root of the polynomial (this is because f(Z) = f(z) 
for all real polynomials f(X)). Again, for any complex number z, the polynomial 
(X — z)(X — Z) = X? — (¢ + 2%) 4+ 2zZisareal polynomial. In turn, it follows 
that any polynomial in R[X] of degree greater than 2 is reducible in R[X]. 


Let F be a field and f(X) be an irreducible polynomial in F[X] of degree n. 
Let F[X] (x) denotes the set of polynomials of degrees less than n together with 
0 (equivalently, the set of remainders obtained when polynomials are divided by 
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J (X)). The addition of polynomials induces the addition in FLX] ¢:x). Further, define 
a multiplication e in F[X] ¢:x) by defining g(X) eh(X) to be the remainder obtained 
when the usual product g(X)h(X) of g(X) and h(X) is divided by f(X). 


Proposition 11.2.10 The triple (F[X] f(x), +, ©) is a field. 


Proof Clearly, (F[X] (x), +) is an abelian group. By the definition r(X) e (s(X) + 
t(X)) is the remainder obtained when r(X)(s(X) + f(X)) is divided by f(X). 
Further, the remainder obtained when the sum of the remainders of r(X)s(X) and that 
of r(X)t(X) is divided by f (X) is same as the remainder obtained when r(X)(s(X) + 
t(X)) is divided by f(X). Thus, r(X) e (s(X) + 1f(X)) = r(X)es(X) + 
r(X) et(X)). Again, since r(X)s(X) = s(X)r(X), it follows that r(X) es5(X) = 
s(X) e r(X). The constant polynomial | is the identity element with respect to e. 
Finally, we need to show that any non zero element in F[X] ¢(x) has a inverse with 
respect to the multiplication e. Let r(X) be a nonzero element in FLX] ¢;x). Since 
deg r(X) < deg f(X) and f(X) is irreducible, a greatest common divisor of r(X) 
and f(X) is 1. Since F[X] is a P.I.D. (Theorem 11.2.6), by Proposition 11.2.2, there 
exist polynomials u(X) and v(X) in F[X] such that 


1 = u(X)r(X) + v(X) f(X). 


It is evident that if we divide u(X)r(X) by f(X), the remainder obtained is 1. Let 
s(X) be the remainder obtained when u(X) is divided by f(X). Then, s(X) € 
FX] ¢cxy and the remainder obtained when s(X)r(X) is divided by f(X) is 1. Thus, 
s(X)er(X) = 1. This shows that (F[X] x), +, @) isa field. tt 


Corollary 11.2.11 The field (F[X] (x), +, ©) is isomorphic to the quotient field 
F[X]/F[X]f(X). Indeed, the map 7 from F(X] f(x) to F[X]/F[X]f(X) given by 
nir(X)) = r(X) + FLX] f(X) is an isomorphism. 


Proof Let g(X) + F[X]f(X) be an arbitrary element of F[X]/F[X]f(X), where 
g(X) is a polynomial in F[X]. Let r(X) be the remainder obtained when g(X) is 
divided by f(X). Then, g(X) — r(X) is a multiple of f(X) and so it belongs to 
F[X]f (X). Hence, g(X) + FLX] f(X) = r(X) + FLX] f(X) = n(r(X)). This 
shows that 7) is surjective map. Again, let (X) and s(X) are polynomials in F[X] ¢:x) 
such that n(r(X)) = (s(X)). Then, r(X) — s(X) € F[X]f(X). Since r(X) and 
5(X) are of degree less than that of f(X), it follows that r(X) = s(X). This shows 
that 7 is injective. Also, n(r(X)) + n(s(X)) = (r(X) + FLX] f(X)) + (s(X) + 
FLX] f(X)) = (r(X) + s(X)) + FLX] F(X) = n(r(X) + s(X)). Finally, since 
r(X) es(X) is the remainder obtained when r(X)s(X) is divided by f (X), it follows 
that n(r(X)es(X)) = (r(X)es(X)) + FLX] F(X) = r(X)s(X) + FIX] F(X) = 
(r(X) + FLX] f(X)) G(X) + FLX F(X) = nir(X))n(s(X)). This shows that 
7 1s a bijective homomorphism. tt 


Example 11.2.12 Since X 2 4 | has no root in R, it follows that it is irreducible in 
R[X]. Consider R[X]x24; = {a + bX | a, b € R}. The addition + and the 
multiplication e in R[X],2.; as described in the above proposition are given by 
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(a + bX) + (c + dX) = (a+ c) + (b + dx) 
and 
(a + bX)e(c + dX) =r4+ sx, 


where r + sX is the remainder obtained when (a + bX)(c + dX) is divided by 
X? + 1. Now, (a + bX)(c + dX) = bdX? + (ad + bc)X + ac. Dividing it 
by X? + 1, we get the remainder (bd — ac) + (ad + bc)X. Thus, 


(a + bX)e(c + dX) = (bd — ac) + (ad + bc)x. 


In particular, X 2 = —1.Itis easily observed that the map (a + bX) —> (a+ bi) 
defines an isomorphism from the field RLX]y2,, to the field C of complex numbers. 
In turn, the field R[X]y2,, can be called the field of complex numbers. 


Example 11.2.13 Consider the field Z) = {0, I} of residue classes modulo 2. 
Clearly, the polynomial X? + X + 1 in Z:[X] has no root in Zs, and as such it 
is irreducible in Z2[X]. For simplicity, let us denote Zp[X],2 , y , 7 by Fy. Thus, 
F, is the set of polynomials of degree at most 1 together with 0. More explicitly, 
F, = {0, 1, X, 1 + X}. The addition and the multiplication e are given by the 
following table: 


+ 0 1 xX {1+ Xx 
0 0 1 X |1+X 
1 1 Oo ji+ xX| Xx 
xX X [1+ xXx] 0 1 
14+ X/1+ x} x 1 0 
e (o] 1 X {1+ xX 

0 |0o) O 0 0 

1 |o} 1 xX {1+ xX 

xX |o| x Ji+ xX] 1 

1+ xjoji1 + x} 1 X 


Since X? + 1 has no root in Z3, as in the above example we have the following: 


Example 11.2.14 X* + Tis irreducible in Z3[ X], and the field Z3[X]y2,7 is a field 
of order 9. 


Remark 11.2.15 In the chapter on Galois theory in the vol 2 of the book, we shall see 
that for every prime p and every natural number 1, there is an irreducible polynomial 
of degree n in Z,[X], and so also a field of order p”. Indeed, we have a nice formula 
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to find the number of irreducible polynomials of degree n in Z,,[X]. All the fields of 
order p” are isomorphic. 


Exercises 


11.2.1 Show that in a P.ID, |.c.m of any pair of nonzero elements exist, and if m is 
l.c.m of a and b, then mua + mvb = ab (or equivalently v/a + u/a = 1/m) 
for some u, v in the PI.D. 


11.2.2 Show that X? + 1 is irreducible in Z3[X], and Z3[X]y2 7 isa field of order 
9. Describe its elements, and also the operations of addition and multiplication in 
Z3[X]x2 + 7- 


11.2.3 Show that X? + X? + Tis irreducible in Z2[X], and so is X? + X + 1. 
Describe the corresponding fields Z2[X] x3, y2,7 and Z2[X]x3,y,7. Show that they 
are isomorphic. 


11.2.4 Show that X? + X? + X + 1 is irreducible in Z3[X]. Describe the 
corresponding field which is of order 27. What are other irreducible polynomials of 
degree 3 in Z3[X]? 


11.2.5 Let f(X) be an irreducible polynomial in F,[X] of degree n, where F, is a 
field of order g. Show that F,[X]/Fy[X](f(X)) is a field of order q”. 


11.2.6 Show that X? + @ is irreducible in Z p|X], p an odd prime if and only if 
a) = —1. In this case, Z,[X]x24z is a field of order p. 


11.2.7 Show that X? + 10 is irreducible in Z pLX] if and only if the remainder 
obtained when the prime p is divided by 40 is different from 1, 3, 9, 13, 27, 31, 37 
and 39. Thus, it is irreducible, for example, in Z43[X]. 


11.2.8 Describe primes p for which X? + 15 is irreducible in Z,[X]. 


11.2.9 Show that X? + X + 1 is irreducible in R[X]. Describe the corresponding 
field. Exhibit an explicit isomorphism from this field to the field C of complex 
numbers. 


11.2.10 Show that CLX, Y]/C[X, Y](X? + Y? — 1) is a PLD. (assume that every 
polynomial over C has a root). Is this result true if we replace C by the field R of 
real numbers? 


11.2.11 Show that Q[X]/Q[X](X?43X?—8) isa field. Is R[X]/R[X](X?+3X?— 
8) a field? 


11.2.12 What can we say about union of a chain of P.I.Ds? Is it always a P.I.D? 


11.2.13 Let R be a PID. Let a,b € R*. Let d be a g.c.d and m a l.c.m of a 
and b. Show that Rm is an ideal of Ra, and Rb an ideal of Rd. Further show that 
Ra/Rm = Rd/Rb. 


11.2.14 Show that Z[./—5] is not a PID. 
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11.3. Euclidean Domains 


Definition 11.3.1 A pair (R, 5), where R is a commutative integral domain and 6 a 
map from R* to N () {0}, is called an Euclidean Domain, if givena,b € R, b £0, 
there exist g, r € R such that 


a=bq+r, 


where r = Oorelse d(r) < 6d(d). 


Example 11.3.2 (Z,| |), where | | is the absolute value map, is an Euclidean 
domain(follows from the division algorithm theorem). Further, (Z, 5), where (a) = 
a”, is another Euclidean domain structure on Z. Thus, on a commutative integral 
domain, there can be several Euclidean domain structures. 


Example 11.3.3 Let F bea field. Then, (F[X], deg) is an Euclidean domain (follows 
from the division algorithm in F[X]). (F[X], 5), where 6(f(X)) = 248) is 
also an Euclidean domain (This fact also follows from the division algorithm). 


Example 11.3.4 The pair (Z[i], 6), where Z[i] is the ring of Gaussian integers, and 
d(a+ bi) =|a+bi 7 = a’ +b’, is an Euclidean domain. 


Proof Letn € Zand m €N. Then, by the division algorithm, there exist g and r in 
Z such that 


n mq + r, 0<r<m,and so also 


n= mq+l1)+r—m,0<r < Mm. 


Clearly, | r |< 5 or|r—m |< . Thus, there exist integers uw and v such that 
m 
n= mu + es els 7. 


Letx = a+bi andy = c-+di be members of Z[i] with y ¢ 0. Then, 
| y |? = ce? +d? € N. From what we observed above, it follows that there exist 
integers a, 3, u, and v such that 


e+ d? 


act+tbd = a(c? +d’) + B, 0<| B I< (11.1) 


and 


2 a 
be —ad = u(c? +d’) + v, 0<|vl< = 


(11.2) 
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Thus, 
xy = (a+bi)(c—di) = (ac+bd)+(be-—ad)i = 
(at+uil(?+d7)+B+iv = (atui)yy+6+iv. 
Hence, 
(G+iv)y 
x= (a + ui)y + age 
Put 
(B+ivyy _ 
(c? + d*) 


Then, since.x, yanda+ui € Z[i], r € Z[i]. Alsod(r) = 4*%. From Eqs. (11.1) 


and (11.2), we see that d(r) < c? + d* = 6(y). This shows that (Z[i], 5) is an 
Euclidean domain. tt 


Example 11.3.5 Let w denotes a primitive cube root of unity. Thus, w 4 1 and 


w> = 1. Clearly, w? is also a primitive cube root of unity. We have = w*, and 


1 + w + w? = O.Consider the set Z[w] = {a+ bw | a,b € Z}. Then, Z[w] is 
a subring of the field C of complex numbers (verify). Thus, Z[w] is a commutative 
integral domain with identity. Define a map 6 from Z[w] to N LJ {0} by 


d(a+bw) = a —ab+h* = (a + bw)a + bu). 


(Note that (a+ bw) = a+ bw’, and so a* — ab+ b’ is always a nonnegative 
integer). Then, (Z[w], 6) is an Euclidean domain. 


Proof Let a + bw and c + dw 4 0 be members of Z[w]. As in the above example, 
we have integers a, 7, u, and v such that 


c+d*—cd 


ac+bd—ad = a(c? +d? —cd) + 8, where | 3 | < 


(11.3) 
and 


ce +d* —cd 


be —ad = u(c? +d? —cd) + v, where | v | < 5 


(11.4) 
Now, from Eqs. (11.3) and (11.4) we get 


(a+ bw)(c+dw) = (ac+bd—ad) + (be -—ad)w = 
(a+ uw)(c? + d* —cd) + B+ ww 
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Since c? +d? —cd = (c+ dw)(c + dw), we have 


Be teenie eae a ee 


+bw = (at >+dw) + ——— 
at bw (a + uw)(c + dw) or i c2 + d* —cd 


Take 


(B+ vw)(c + dw) 
r= . 
c2 +d? —cd 


Then,r = (a+bw) — (a+uw)(c + dw) belongs to Z[w]. Further, 


(P+v?—Bvi(e+d>—cd) KP +v-Bv 
(c? + d? — cd)? ~ (24 d2—cd" 


oir) = 


Using the in equalities in (11.3) and (11.4), we get that 


2 oo 
3(c7 +d cd) “ 


i Ct+d*—cd = 6(c+dw). 


o(r) < 


This shows that (Z[w], 6) is an Euclidean domain. tt 


Theorem 11.3.6 Let (R, 6) be an Euclidean domain. Then, R is a principal ideal 
domain. 


Proof We first show that every ideal of R is of the form Ra for some a € R. Let A 
be an ideal. If A = {0}, then A = R - 0, and there is nothing to do. Suppose that 
A & {0}. Consider the set S = {d(a) | a € A— {0}}. Clearly, S is a nonempty subset 
of N U{0}. By the well-ordering principle in N {0}, there exists a aj € A — {0} 
such that 6(ag) is the least element of S. We show that A = Rap. Since A is an ideal 
and ay € A, Rag C A. Letb € A. Since ap ¥ 0, there exist g, r € R such that 


b = qa, +7, 


wherer = Oorelse d(r) < d(a). Clearly,r = b—agq € A. Hence,r = 0, 
for otherwise 6(r) € S, and d(r) < 6(ao), a contradiction to the choice of ag. Thus, 
b = qd € Rag. This proves that every ideal of R is of the form Ra. It is sufficient, 
now, to show that R has identity. Since R is an ideal of R, it follows that R = Rao 
for some dao € R. Thus, there exists e € R such that ag = edgy. Note that ay 4 0, for 
R {0}. Since R is commutative, ea9 = dyp = age. Letb € R. Since R = Rao, 
there exists c € R such thatb = cao. Now, be = cape = cao = Db. Since R is 
commutative, e is the identity of R. tt 


Corollary 11.3.7 In an Euclidean domain, g.c.d exists, and every irreducible ele- 
ment is prime. 


Proof Since every Euclidean domain is a principal ideal domain, the result follows 
from Proposition 11.2.2 and Corollary 11.2.3. tt 
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Corollary 11.3.8 Z[i] and Z[w] are principal ideal domains. tt 


Example 11.3.9 Every field F is an Euclidean domain with respect to the 6 defined 
by 6(a) = 1 for all a ¥ 0 (verify). In particular, the field C of complex numbers is 
an Euclidean domain. Clearly, Z[X] is not an Euclidean domain, for it is not even a 
principal ideal domain. C[X] is an Euclidean domain. Thus, subring of an Euclidean 
domain need not be an Euclidean domain. 


Remark 11,3.10 Arithmetic properties of an Euclidean domain (R, 6) do not depend 
on a particular choice of 6. Some authors define an Euclidean domain to be a com- 
mutative integral domain for which such a 6 exists. 


Remark 11.3.1] There are principal ideal domains which are not Euclidean domains 
(see Euclidean Algorithms BAMS 1949, Vol 55). 


Example 11,3.12 Let (R, 6) be an Euclidean domain. Let a, b € R*. Then, we can 
find a greatest common divisor d of a and b and also elements u and v of R such that 
d = ua + vbas follows: Since (R, 6) is an Euclidean domain, there exist g,r € R 
such that 


a=bq+r, 
wherer = Oorelse d(r) < 6(b).Ifr = 0,thenb/aandb = 0-a + 1-bisa 
g.c.d of a and b. Suppose that r 4 0. The common divisors of a and b are same as 
those of b and r. Thus, 
(aby, T). 
Again, since (R, 5) is an Euclidean domain, there exist g; and r; such that 
b=qr+n, 
where r} = O orelse 6(r;) < d(r) < 6(b). Ifr; = 0, then 


(a,b) ~ (b,r)~r = la-—gqb 


and we are through. Suppose that r; # 0. Then again common divisors of b and r 
are same as those of r and r;. Thus, 


(a,b)~ ,r)~ ri), 


where rr) = —qir + b = —qia + (qiq + 1)b. Again, there exist gz and r2 
such that 


r= gri + ra, 
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where r2 = O orelse d(r2) < d(r1) < d(r) < 6(b). Ifr2 = O, then 
a,b)~On~@ny~n = —qga + (qq + Ib. 
If r. € 0, proceed as above. This process stops after finitely many steps giving us 
(a,b) ~ (byt) Gn) Yor ~ Gian) Yn = aa + Ud 


for some u;, v; € R. 


Example 11.3.13 Using the above algorithm and the division algorithm in Z[i] as 
explained in Example 11.3.4, we find g.c.d and l.c.m of 8 + 6i and 5 + 15i in Zi]. 
This illustrates a method to find a g.c.d and |.c.m of any two Gaussian integers. 

We observe that 100 = 6(8+6i7) < 6(5+15i) = 250. Now, (5+15i)(8—6i) = 
130 + 907. Further, 130 = 1 x 100 + 30and90 = 1 x 100 + (—10). Thus, 


(5 + 151)(8 — 61) = (1+i) x 100 + G0- 10i) = 
(1 + 1)(8 + 67)(8 — 67) + (30 — 102). 


Hence, 


(54+ 151) = 14+1)(84+ 61) + ze = (1+1)(8+ 61) + (33+4+/). 


Thus, the remainder obtained when 5 + 157 is divided by 8 + 67 is 3 +7, and g.c.d 
of 5 + 157 and 8 + 67 is same as that of 8 + 67 and 3 + 7. Again consider 


(8+ 61)3—i) = 304+ 10i = G+i)x 10 = 3+1)3+i1)GB-i). 


Hence, 8+ 6i = (3+i7)(3+/7). This shows that g.c.d of 8+ 6i and3+i7is3+i. 
Thus, g.c.d of 5 + 157 and 8 + 6i is 3 + i, and also 


3+7 = 1-(54+ 157) + (-1—i1)(8+ 61). 


Further, l.c.m of 5 + 15i and 8 + 67 is SEEPSE = (5 4 151)(3 +i) = SOI. 
Exercises 


11.3.1 Show that (Z[./—2], 6), where 6(a + b\/—2) = a? + 2b? is an Euclidean 
domain. 


11.3.2 Find g.c.d and l|.c.m of the following pairs in Z[i]. Also express g.c.d as linear 
sum of the corresponding pairs. 

@)3+4i, 443i. 

(ii) 4+ 27, 3+ 4). 
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11.3.3 Find g.c.d and |.c.m of the following pairs in Z[w]. Also express g.c.d as 
linear sum of the corresponding pairs. 

(i)2+3w, 6+ 8w. 

Gi) 5+ 1llw, 2+ 3w. 


11.3.4 Find g.c.d and I.c.m of the following pairs in Z[./—2]. 
@5+7/-2, 2+ /-2. 
(i) 3+ /-2, 443-2. 


11.3.5 Generalize the Chinese remainder theorem of Chap.6 to principal ideal 
domains and Euclidean domains. 

7 ZAi] ae 
11.3.6 Determine the field Zrd+i" What is its order? 
11.3.7 Let p be a prime element of Z, which is not a prime element of Z[i]. Show 
that p is product of two complex conjugates in Z[i]. Deduce that if a prime element 
of Z is not a prime element of Z[i], then it is sum of squares of two integers. 


11.3.8 Let a + bi be a prime element of Z[i]. Show that a* + b? is prime element 
of Z, or it is a square of a prime element of Z. 


11.3.9 Let p € Z be a prime integer. Suppose that p is prime in Z[i]. Show that 
Pp = 3(mod 4). 


11.3.10 Let p be a prime element of Z which is not prime element of Z[i]. Show 
that (>) = 1. Deduce that p = 1(mod 4). 


11.4 Chinese Remainder Theorem in Rings 


In this section, we generalize the Chinese remainder theorem of Chap. 6 to arbitrary 
commutative rings with identities. 


Definition 11.4.1 A pair (A, B) of ideals of R is said to be co - maximal if A + 
B= R. 


Example 11.4.2 Ina principal ideal domain, a pair (Ra, Rb) is co-maximal if and 
only if Ra + Rb = R. This is equivalent to say that (a, b) ~ 1. 


Theorem 11.4.3 (Chinese Remainder Theorem). Let R be a commutative ring with 
identity. Let {A,, Az2,..., Ay} be a set of pairwise co-maximal ideals of R. Let 
a1, d2,...a, be members of R. Then, there exists x € R such that x — a; € A; 
for alli. 


Proof We first prove the result forr = 2. Let {A;, Az} be a pair of co-maximal 
ideals. Let aj, a. € R. Since Ay + Ar = R, 1 = uy + up forsome uw, € Aj 
and uy € A>. Take x = dou, + aju2. Then, x —a; = dou, + ay(u.—1) = 
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anu; + ay(—u;) = (a2 —a))u, € Aj. Similarly, x — a2 € Az. Assume that 
the result is true for r. Let {A;, Az,..., A-, A-+1} be a set of pairwise co-maximal 
ideals. Let a), do, ..., 4, d,41 be elements of R. By the induction assumption, there 


exists a y € R such that y — a; € A; for alli <r. We have 
A\ + Ar+1 = R, A2 + Ar-+1 => R,..., A; + Ar+t => R. 


Multiplying the above equations, we get that 


Arti + (Ai (| A2() (A) = 8. 


Put (A; (} A2()---() Ar) = B. Then, (A,+1, B) is a pair of co-maximal ideals. 
From the case r = 2 (already proved), there exists an element x € R such that 
X — a-4, € A,y; andx — y € B. But, thenx —a; = (x—y) + (vy—a;) belongs 
to A; for alli <r. tt 


Corollary 11.4.4 Let R be a principal ideal domain. Let {m,, m2, ...,m,} be a set 
of pairwise co-prime elements of R. Let ay, dz, ..., a; be elements of R. Then, there 
exists an element x € R such that 

x =a;(mod m;) for alli <r. 


Proof Since R is a principal ideal domain, 


(m;,mj;) ~ 1 ifand only if Rm; + Rm; = R-1 = R. 


Thus, {Rm,, Rm, ..., Rm,} 1s a set of pairwise co-maximal ideals of R. From the 
Chinese remainder theorem, there exists ax € R such that x — a; € Rm; for all 7. 
This means that x = a;(mod m;) for alli < r. tt 


Corollary 11.4.5 Let R be a commutative ring with identity. Let {A,, A2,..., A,r} 
be a set of pairwise co-maximal ideals of R. Let A = A, (| A2(\---(\ Ay. Then, 
R/A is isomorphic to R/A, x R/Az x +--+ x R/A,;. 


Proof Define a map f from R to R/A; x R/A2 x---x R/A, by 
f(a) = (a+ Aj,a+A2,...,a+A,). 
It is easy to see that f is aring homomorphism. Let (a; + A;, d2+A2,...,4,+A,) 
be an arbitrary element of R/A; x R/A2 x --- x R/A,. By the Chinese remainder 
theorem, there exists x € R such that x — a; € Aj; for all i. This means that 
x+A; = a; + A; for alli. Thus, f is surjective. Now, 
ker f = {x ER|x+A, = A, Vi} = (hE R|x€ A, Vi} = (4: 


The result follows from the fundamental theorem of homomorphism. tt 
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Corollary 11.4.6 Let R be a principal ideal domain. Let {m,,mz,...,m,} be a 
set of pairwise co-prime elements of R. Letm = mymz2---m,. Then, R/Rm is 
isomorphic to R/Rm, x R/Rm2z x --- x R/Rm,. 


Proof The result follows, if we observe that {Rm,, Rm2,..., Rm,} is a set of pair- 
wise co-maximal ideals and Rm = Rm, () Rm2()\ ---() Rm,. tt 


Remark 11.4.7 Let A and B be ideals of a commutative ring R with identity. Recall 
that the ideal generated by the set {ab | a € Aand b € B}is denoted by AB. Thus, 
AB C Af) B. In general, AB need not be A () B. For example, (4Z)(6Z) = 24Z 
where as 4Z,(\6Z = 12Z. However, if {Ai, A2,..., A,} is a set of pairwise co- 
maximal ideals of R, then AyA2---A, = Aj) A2()---() Ar. The proof of this 
fact is by induction on r. Suppose that r = 2. Let (Aj, Az) be a pair of co-maximal 
ideals. Then, A} + A. = R, and there is an element uv; € A, and an element 
ug € Ag such thatu; + uy = 1. Letx € Aj) Ao. Then,x = xu; + xu2 € 
A\ Ao, for xu,,xu2 € A Az. This shows that A;A2 = Aj {] A>. Assume that 
the result is true for r. Let {A;, Ao,..., A-+1} be a set of pair wise co-maximal 
ideals of R. By the induction hypothesis, A;A2--- A, = A,{)A2{}---{) Ar. 
As already observed in the proof of the Theorem 9.4.3, (A; A2---A,, A,41) is a 
pair of co-maximal ideals. Hence, A, A2--- A-A;41 = (Al A2°:- Ar) (Ar = 
Ai ()A21)--- Ar 1 Arqi- 


11.5 Unique Factorization Domain (U.F.D) 


Definition 11.5.1 A commutative integral domain R with identity is called a 
unique factorization domain (U.F.D) if the following conditions are satisfied. 

(i) Every nonzero nonunit element of R can be expressed as a product of irreducible 
elements of R, 
and 

(ii) the representation of any nonzero nonunit element a as product of irreducible 
elements is unique in the sense that if 


a = Pipr Pr = Pi Pos Po 


where p; and p’, are irreducible elements of R, thenr = s, and there is a permutation 
o of {1,2,..., r} such that pj ~ Poi) for alli. A unique factorization domain is also 
termed as Gaussian domain. 


Example 11.5.2. ZisaU.ED (follows from the fundamental theorem of arithmetic). 


Theorem 11.5.3 Let R be a commutative integral domain with identity. Then, the 
following conditions are equivalent. 
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TI, Risavu.FD. 

2. (i) Given a sequence a), 42, ..., An, Anti, .. of elements of R such that an41/Qn 
for alln €N, there exists ano € N such that a, ~ a, for alln > no. 

(ii) Greatest common divisors exist in R in the sense that any two nonzero elements 
of R have a g.c.d. 

3. (i) Same as 2(i). 


(ii) Every irreducible element is a prime element. 


Proof 1 => 2. Suppose that R is a unique factorization domain. Let 
Q1,42,+++54n, 4n+15--- 


be a sequence of elements of R such that a,41/a, for all n. Suppose that there is no 
m such that a, ~ a, for alln > m, then no a, is a unit (divisors of units are only 
units), and we can extract a subsequence 


G1, ny, Ang, ++ +5 An,» An, yis ++ 


of the given sequence such that a,,,, is a proper divisor of a, for all r. Since ay isa 
nonunit, and R is a unique factorization domain, 


a) = Pip2°** Pr, 


where each p; is irreducible element of R. Since a,, is a proper divisor of a}, a; = 
Gn, bn,, Where ay, and b,, are nonunits. Since a,, is a proper divisor of dy,, Gn, = 
An,bn,, Where a,, and b,, are again nonunits. But, then 


ay = An, Dn, Dn, - 
Proceeding inductively, in this way, we get 
aq = An, Pn, On,_, bee Dn, , 


where a,, is a nonunit and each by, is a nonunit. Since R is a unique factorisation 
domain a, and each b,,, are products of irreducible elements of R. But, then a, will 
also be expressible as product of more that r irreducible elements of R. This is a 
contradiction to the supposition that R is a unique factorization domain. Thus, 1 
implies 2(i). 

Next, assuming 1, we show that g.c.d of any two nonzero elements exist. Let a, b 
be nonzero elements of R. If a is a unit, then a divides b, and so a is a g.c.d of a and 
b. Suppose that neither a nor b is a unit. We can write 


4 Qa) a2 ay, 
az uP; Po 11D, 


408 11 Arithmetic in Rings 


and 


By, 6 Bo 
b = p\' py’ ++: Py’, 
where uw is a unit a;, 3; > O for all i and {p, po,..., p,} is a set of pairwise 
nonassociate irreducible elements of R. Take 


—_ M1 ba Ly 
d = P; P> 11 Dp. ; 


where 4; = min(a;, 3;). We show that d is a g.c.d of a and b. Clearly, d/a and 
d/b. Suppose that d'/a and d'/b. If d' is a unit, then it divides d. Suppose that d’ is a 
nonunit. Suppose that p is an irreducible element which divides d’. Then, p divides 
a, and also p divides b. By the uniqueness of the factorization, p ~ p; for some i. 
Thus, we can write 


— 61 ..02 5; 
d= vpy' Pr ++: Py’ 
where v is a unit. Since d’/a, 


Qy a Q, On 


59 : 
a= Up; Po Dp = Up; py -->pre. 


for some c. Suppose that 6; > a). Then 


a2 on —a a2 


up» +++ per = Up; Po es perce, 


By the uniqueness of the factorization, p; ~ pj; for some j > 2. This is a con- 
tradiction to the supposition that {p1, p2,..., p-} is a set of pairwise nonassociate 
irreducible elements of R. Hence, 6; < a,. Similarly 6; < a; and 6; < (3; for alli. 
Now, it is clear that d’/d. Hence, d is a g.c.d of a and b. 

2 => 3. Assume 2. 3(i) is same as 2(i). Again, since g.c.d of any two nonzero 
elements exist, it follows (Corollary 11.1.9) that every irreducible element is prime. 
Thus, 2 => 3. 

3 => 1. Assume 3. Using 3(i), we show that every nonzero nonunit element of R 
can be expressed as product of irreducible elements of R. Let a be a nonzero nonunit 
element of R. We first show that a has an irreducible factor. If a is irreducible, then 
there is nothing to do. If not, a = a,b, where a, is a proper divisor of a. If a, is 
irreducible, we get an irreducible factor of a. If not, a; = azb2, where a is a proper 
divisor of a,. If az is irreducible, we get an irreducible factor of a;, and so of a. If 
not, proceed. This process stops after finitely many steps giving us an irreducible 
factor p, of a, for otherwise we shall get a sequence 


Q|,42,---,4n, 4n+1,--- 


of elements of R such that each a;+, is a proper divisor of a;. This is a contradiction 
to 3(i). Let 
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a= pic. 


If c; is irreducible, we are through. If not, as before, there is an irreducible factor p2 
of c,. Thus, 


a = Pip2c2, 


where cp is a proper divisor of c . If cz is irreducible, we are through. If not, proceed. 
This process also stops after finitely many steps giving us a factorization 


—— PiP2°°+ Pr 


of a as product of irreducible elements of R, for otherwise we shall arrive at a 
sequence 


C1, C2,---,Cn, Cnht+l> se 


of elements of R such that each c,+) is a proper divisor of c, (this is a contradiction 
to 3(i)). 
Finally, we prove the uniqueness of the factorization. Let 


a = Pipr-** Pr = PiPo-** Py 


where p; and P’, are irreducible elements of R. Since every irreducible element is 
supposed to be prime, each p; and p’. are prime elements. Now, p; dividesa = 
P\P2°*: Py. Hence, p; divides p’, for some j. After rearranging, we may assume 
that p; divides p{. Since p' is irreducible, and pj is not a unit (being irreducible), 
Pi ~ p}. Suppose that p; = up, where wu is a unit. Then 


P1P2*** Pr = UPI Py: +: Pi. 


Canceling p1, we get that 


P2P3°°* Pr = UPyP3*** Py 


As above p2 ~ up ~ p. Proceed inductively. In this process, {p1, p2,..., Pr} 
and {p', p5,.-., p,} both will exhaust simultaneously, for otherwise we shall arrive 
at a product of irreducible elements equal to identity. This is not possible, for an 
irreducible element can not be a unit. This ensures that rr =  s, and after some 
rearrangement p; ~ p; for all i. ft 


Corollary 11.5.4 Every principal ideal domain is a unique factorization domain. 


Proof Let R be a principal ideal domain. By Proposition 11.2.2, g.c.d. of any two 
nonzero elements in R exists. Thus, it is sufficient to prove the condition 2(i) of the 
above theorem. Let 
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Q|,42,.--,4n, 4n+1,--- 


be a sequence of elements of R such that a,,)/a, for all n. Then, by Proposi- 
tion 11.1.10, we get an ascending chain 


Ray © Rag © +++ Ray © Ragyi C++: 


of principal ideals of R. Let 


A= Jee Ra,. 


Then A, being union of ascending chain of ideals, is an ideal. Since R is a principal 
ideal domain, A = Rd forsomed € R.Now,d € Rd = A = Ws Ra,. Hence, 
d € Ra,, for some no € N. But, then 


Rd © Ray CJ” Ray = A = Ra. 


Hence, 
CO 
Ra, = A= U Ray. 
This implies that Ra, = Ra,, for alln > no. By Proposition 11.1.10, a, ~ a,, for 
all n > no. This completes the proof. ft 


Since every Euclidean domain is a principal ideal domain, we have, 
Corollary 11.5.5 Every Euclidean domain is a unique factorization domain. tt 


Example 11.5.6 It will follow soon (Gauss theorem) that Z[X] is a U.F.D. Clearly, 
then Z[X] is not a P.I.D for Z is not a field. Thus, a U.ED need not be a P.I.D. 


Example 11.5.7 Every field is a U.F.D. In particular, the field C of complex numbers 
isa U.ED, whereas the subring Z[,/—5] is not a U.E.D for 3 is irreducible in Z[./—5], 
but it is not prime. Thus, subring of a U.E.D need not be a U.E.D. 


Example 11.5.8 A homomorphic image of a U.E.D need not be a U.E.D (even if it 
is an integral domain): For example, Z[X] is a U.E.D, the map f(X) ~» f(./—5) is 
a surjective homomorphism from Z[X] to Z[./—5] but, Z[./—5] is not a U-.ED. 


Example 11.5.9 Ina principal ideal domain, we noticed that if d is g.c.d of a and b, 
then there exist u, v € Rsuchthatd = ua-+ vb. This result is not true in a U.F.D. 
For example, (2, X) ~ 1 but 2f(X) + Xg(X) can never be 1. 


Gauss Theorem 
Our next aim is to prove the following theorem due to Gauss. 
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Theorem 11.5.10 (Gauss) /f R is a unique factorization domain, then R[X] is also 
a unique factorization domain. 


To prove this result, we need some concepts and results. 

Let R be a unique factorization domain. Let f(X) € R[X], f(X) £0.A g.c.d 
of nonzero coefficients of powers of X in f(X) is called a content of f(X), and it 
is denoted by c(f(X)). Thus, a content of a polynomial is unique upto associates. 
Integers 2 and —2 are contents of the polynomial 2+ 10X* + 12X* in Z[X]. Gaussian 
integers 1 + i, —1 —i, 1 —i, and — 1 +i are contents of the polynomial 2 + (1 + 
i)X + 2iX? in Z[i][X] (prove it). 

If f (X) and g(X) are associates in R[X], then they differ by a unit in R[X]. Since 
units of R[X] are those of R, f(X) = ug(X) for some unit u of R. This shows that 
c(f (X)) ~ c(g(X)). Thus, polynomials which are associates have same contents. 
Nonassociate polynomials may also have same contents for example 2 + 10X7 and 
2+ 10X? + 12X* have same contents but, they are not associates. 

A nonzero polynomial f (X) is said to be a primitive polynomial if c(f(X)) ~ 1. 

Thus, 3 + 2X + 5X? is a primitive polynomial in Z[X]. 

Let f(X) 4 0 be a polynomial in R[X]. Then 


I(X) = c(f(X)ACO), 
where f,(X) is a primitive polynomial in R[X]. Also if 
S(X) = afi (X), 
where f|(X) is primitive, then c(f(X)) ~ a. 


Lemma 11.5.11 (Gauss) Let R be a unique factorization domain. Then, the product 
of any two primitive polynomials in R[X] is a primitive polynomial. 


Proof Let 
FR) = a bh ayk + a oe he ak” 
and 
g(X) = by + BX + bX + 0+) + By X”™ 
be two primitive polynomials. Then, we have to show that 
F(®)g(X) = co + er + cK? + ee + Cam X"™, 
where 


cy = Yi4j-raibj, r=1,2,...,n-+m 
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is a primitive polynomial. It is sufficient to show that no prime element of R divides 
each c,. Let p be a prime element of R. Since f(X) is primitive, p can not divide 
each a;. Thus, there exists ig such that p/a;, for all k < ip, and p does not divide 
dj,. Similarly, since g(X) is also a primitive polynomial, there exists a jo such that 
p/b; for alll < jo, and p does not divide b;,. Consider 


Cigt in = VitjainrjoUDj --> - (11.5) 


Ifi+ jy = io+ jo, i A io,and j A jo, theni < ig or j < jo. Hence, each 
term under summation in the right hand side of the Eq. (11.5) is divisible by p except 
Gy D jo Which is not divisible by p. This shows that p does not divide cj, j,, and hence, 
Ff (X)g(X) is a primitive polynomial. ft 


Corollary 11.5.12 Let f(X) and g(X) be nonzero polynomials in R[X], where R 
is a U.F-D. Then 


c(f(X)g(X)) ~ c(f (X))e(g(X)). 


Proof f(X) = c(f(X))fi(X) and g(X) = c(g(X))gi(X), where f)(X) and 
gi(X) are primitive polynomials. Further, 


IMEX) = c(f(X)e(g(X) fis), 


where, by Gauss Lemma, /; (X)g (X) is primitive. It follows that c(f (X))c(g(X)) 
is acontent of f(X)g(X). t 


Lemma 11.5.13 Every irreducible element of R is also an irreducible element of 
R[X]. 


Proof Let p be an irreducible element of R. Suppose that p = f(X)g(X), where 
F(X), g(X) € R[X]. Comparing the degrees, we find that deg(f(X)) = 0 = 
deg(g(X)). Thus, f(X), g(X) € R. Since p is irreducible in R, f(X) or g(X) isa 
unit in R. Since units of R are also units of R[X], the result follows. tt 


Lemma 11.5.14 Let R be a unique factorization domain, and F be its field of frac- 
tions. Let f (X) and g(X) be primitive polynomials in R[X] which are associates in 
F[X]. Then f (X) and g(X) are associates in R[X]. 


Proof We know that the units of FX] are those of F’. Thus, the units of F |X] are pre- 
cisely the nonzero elements of F’. Let f(X) and g(X) be two primitive polynomials 
in R[X] which are associates in F[X]. Then, f(X) = ug(X), where u is a nonzero 
element of F. Since F is the field of fractions of R, u = ab~! for somea,b € R*. 
Hence, f(X) = ab~'g(X). In turn, bf(X) = ag(X) for some a,b € R*. Since 
Ff (X) and g(X) are primitive in R[X], comparing the contents we find that b ~ a in 
R, and so f(X) and g(X) differ by a unit of R. Thus, f(X) ~ g(X) in R[X]. tt 


Lemma 11.5.15 Let f(X) be an irreducible element of R|X] of positive degree. 
Then, f (X) is irreducible in F(X], where F is the field of fractions of R. 
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Proof Suppose that f(X) is an irreducible element of R[X] of positive degree. 
Then, it is a primitive polynomial in R[X], for otherwise a content of f(X) would 
be a proper divisor of f(X). Suppose that it is reducible in F[X]. Then, f(X) = 
g(X)h(X), where g(X) and h(X) are polynomials in F[X] of positive degrees (for 
nonzero elements of F are units of F[X]). Suppose that 


g(X) = up + my X + ugX? +--+. + u,X" 
and 
h(X) = vo +t vuyX + VX? St. dace: ef Vin X", 


where u; and v; are members of F. If b is l.c.m of the denominators of nonzero 
coefficients of powers of X in g(X), then there is an element a € R such that 


g(X) = ab“! g\(X), 


where g)(X) is a primitive polynomial in R[X] of same degree as that of g(X). 
Similarly, 


h(X) = cd~'hj(X), 
where c,d € R* and h;(X) is a primitive polynomial in R[X]. Thus, 
f(X) = ac(bd)'g1(X)hi(X) 
or 
bdf(X) = acg)(X)hi(X). 
By the Gauss lemma, g)(X)h (X) is a primitive polynomial, and since f(X) (being 


irreducible element of R[X] of positive degree) is primitive, comparing the contents 
we obtain that 


bd ~ ac and f(X) ~ g\(X)hy(X) in R[X]. 


Since deg(gi(X)) = deg(g(X)) > Oanddeg(h(X)) = deg(h(X)) > 0, gi(X) 
and h,(X) are proper divisors of f(X) in R[X]. But, then f(X) would become 
reducible in R[X]. tt 


Proof of the Gauss theorem. We first show that every nonzero nonunit element 
F(X) of R[X] can be written as product of irreducible elements of R[X]. This, 
we prove by the induction on degree of f(X). If deg(f(X)) = 0, then f(X) € 
R. Since R is a U.ED, f(X) is a product of irreducible elements of R. By the 
Lemma 11.5.13, every irreducible element of R is also irreducible in R[X], and 
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hence, in this case f (X) is product of irreducible elements of R[X]. Suppose that the 
assertion is true for all those polynomials whose degree is less than the deg(f(X)), 
where deg(f(X)) => 1. Now, f(X) = c(f(X)) fi (X), where f; (X) is a primitive 
polynomial of degree same as that of f (X). If c(f (X)) is a nonunit, it is expressible 
as product of irreducible elements of R, and so of R[X]. Hence, it is sufficient to show 
that f; (X) is product of irreducible elements of R[X]. If f; (X) is irreducible, nothing 
todo. Suppose not. Then, f;(X) = g(X)h(X), where neither g(X) nor h(X) is aunit. 
Since f|(X) is a primitive polynomial, deg(g(X)) > Oanddeg(h(X)) > O. But, 
then deg(g(X)) < deg(fi(X)) = deg(f(X)) anddeg(hy(X)) < deg(fi(X)) = 
deg(f (X)). By the induction hypothesis, g(X) and h(X) are products of irreducible 
elements of R[X]. But, then f;(X) is also expressible as product of irreducible 
elements of R[X]. 
Now, we prove the uniqueness of the factorization of f(X). Suppose that 


F(X) = pipr-s PrA( XO) f(X)-- f(X) = Pips PPACO KX A,X, 


where p;, P; are irreducible elements of R, and f,(X) and f/(X) are irreducible ele- 
ments of RX] of positive degrees. Clearly, then f,(X) and f/(X) are primitive poly- 
nomials of R[X]. By the Gauss lemma, /\(X) fo(X) +--+ fo(X) and f/(X) fh(X)--- 
f/(X) are primitive polynomials. Comparing the contents, 


P1P2*** Pr ~ Pi Po*** Pi 


and 


AQOAXY: fXO) ~ ffOLO--- OX. 


Since R isa U.RD,r = ft, and after some rearrangement p; is an associate of p} 
in R for all i. Hence, p; is an associate of p; in R[X] also. Next, since f,(X) and 
f/ (X) are irreducible elements of R[X] of positive degrees, by Lemma 11.5.15, they 
are irreducible in F[X], where F is the field of fractions of R. Since F[X] (being 
aPJI.D)isaU.FRD,s = u, and after some rearrangement f;(X) is an associate of 
f{(X) in F[X] for all k. Since fi, (X) and f/(X) are primitive, by the Lemma 11.5.14, 
fx(X) is an associate of f{(X) in R[X] for all k. This shows that R[X] is a unique 
factorization domain. tt 


Corollary 11.5.16 Jf R is a U.F-D, then R{X,, X2,..., Xn] is also a U.ED. 

Proof Follows by Gauss theorem and induction on n. ft 
Since Z is a U.E.D, we have the following. 

Corollary 11.5.17 Z[X] is a U.ED. tt 
Following result is the converse of the Gauss theorem. 


Theorem 11.5.18 Jf R[X] is a U.ED, then R is also a U.F-D. 
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Proof Suppose that R[X] is a U.ED. Let a be a nonzero nonunit element of R. 
Then, it is nonzero nonunit element of R[X] also. Hence, a is product of irreducible 
elements of R[X]. Comparing degrees, we obtain that all irreducible factors are 
elements of R also. Next, suppose that 


a = Pipr-** Pr = PPh-** Py 


where p; and p’, are irreducible elements of R, and so of R[X] also. Since R[X] is 
aU.F.D,r = s, and after some rearrangement p; and p; differ by units of R[X]. 
The result follows if we observe that the units of R[X] are those of R. tt 


Criteria for Irreducibility of Polynomials 


Let R; and Ro be commutative integral domains with identities. Let 7 be a nonzero 
ring homomorphism from Rj, to R2. Let 


f(X) = ap + aX + (i X* 4+ ++) + a,X" 
be a polynomial in R;[X]. We have a polynomial f’(X) € R2[X] given by 
S°(X%) = aay) + a(ay)X + +++ + aay) X". 


The map f(X) ~ f7(X) is clearly a ring homomorphism which is surjective if and 
only if o is surjective, and it is injective if and only if o is injective. 


Theorem 11.5.19 Let R, and Ry be commutative integral domains with identities, 
and 0 a homomorphism from R, to Ro. Let Fy be the field of fractions of Ro. Let 
F(X) © Ri[X]. Suppose that deg( f°(X)) = deg(f (X)) (in particular f (X) and 
f° (X) are nonzero polynomials), and f° (X) is irreducible in Fy[X]. Then f (X) 
can not be expressed as product of two nonconstant polynomials in R,[X]. 


Proof Suppose that f(X) = g(X)h(X), where g(X) and h(X) are polynomials in 
R,[X]. Then 


F°(X) = g°(X)h?(X). 


Since deg(f(X)) = deg(f°(X)), it follows that deg(g(X)) = deg(g’(X)) and 
deg(h(X)) = deg(h’(X)). Since f" (X) is irreducible in Fy[X], g’(X) or h?(X) 
is a unit of F[X]. But, then deg(g(X)) = deg(g’(X)) = Oordeg(h(X)) = 
deg(h°(X)) = 0. Hence, g(X) is constant or A(X) is constant. tt 


Corollary 11.5.20 Let R; be a U.F-D, and Rz a commutative integral domain with 
identity. Let 0 be a ring homomorphism from R, to Ro. Let f(X) be a primitive 
polynomial in R\[X] with deg(f(X)) = deg(f?(X)). If f° (X) is irreducible in 
F,[X], then f (X) is irreducible in R,[X], and also in F\[X], where F; is the field 
of fractions of Rj. 
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Proof Follows from the above theorem if we observe that a primitive polynomial in 
R,[X] is irreducible if and only if it can not be written as product of two nonconstant 
polynomials. Also a primitive polynomial in R,[X] is irreducible in R,[X] if and 
only if it is irreducible in F)[X]. tt 


Corollary 11.5.21 Let R bea U.ED. Let p be a prime element of R and © the prime 
ideal generated by p. Let 


S(X) = ao + aX + aX? foes) + a,X" 


be a polynomial in R[X] of positive degree such that p does not divide a,. Suppose 
that 


F(X) = G + HX + +) + GX", 
where G = a; + © € R/@. Suppose that f (X) is irreducible in R/g[X]. Then 
J (X) is irreducible in F|X], where F is the field of fractions of R. 


Proof Let f(X) = c(f(X)) fi(X), where f;(X) is a primitive polynomial of same 
degree as that of f(X). Also f(X) is irreducible in F[X] if and only if f,;(X) is 
irreducible in F[X]. If p does not divide the leading coefficient of f(X), then it 
does not divide the leading coefficient of f;(X) also. Next, if F(X ) is irreducible in 
R/g[X], then F(X) is irreducible in R/go[X]. Thus, we may assume that f(X) is 
a primitive polynomial. Suppose that f(X) is reducible in F[X]. Then, since f(X) 
is primitive in R[X], 


F(X) = g(Xh(X), 

where g(X) and h(X) are polynomials in R[X] of positive degrees. But, then 
F(X) = B(X)A(X). 

Since p does not divide a,, deg(f(X)) = deg(f (X)). Hence, deg(g(X) = 


deg(g(X)) and deg(h(X)) = deg(h(X)). Thus, g(X) and h(X) are of positive 
degrees. But, then f(x ) would be reducible in R/go[X]. tt 


Corollary 11.5.22 Let 
S(X) = ao + aX + aX? foes) + a, X" 


be a polynomial in Z[X] such that p does not divide a, for some prime p. Suppose 
that 


F(X) = G + GX + HX? + --- + GX" 


is irreducible in Z,[X). Then, f (X) is irreducible in Q(X]. 
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Proof Since Z, is a field, FO irreducible in Z,[X] implies that deg(f (X)) = 
deg(f(X)) > 0. The result follows from the above corollary. tt 


Remark 11,.5.23 The above corollary gives us a sufficient condition for a polynomial 
in Z[X] to be irreducible in Q[ X]. Thus, 


S(X) = ao + ayX + aX? foes) + a, X" 


in ZX] is irreducible in Q[X] if for some prime p of Z, p does not divide a, and 
f (X) is irreducible in Z,[X]. This criteria is quite useful in checking the irreducibility 
of f(X). 


Example 11.5.24 We shall see in a later chapter that the polynomial X? — X +a 
is irreducible in Z,,[X] whenever a # 0. In particular, X? — X + 1 is irreducible 
in Z,[X] for every prime p. It follows from the above corollary that Xx? — X + 1 
is irreducible in Q[X]. In fact X? — X +4 ais irreducible in Q[X] whenever p does 
not divide a. The polynomial X 5 10x? 6X 1 is also irreducible in Q[X], 
for in Zs[X], and the polynomial X° 10X 6X 1= X° -— X — lis 
irreducible. 


Theorem 11.5.25 (Eisenstein Irreducibility Criteria) Let R be a unique factorization 
domain, and 


f(X) =a + ax + dy X* + ee) + a, X" 
be a polynomial in R[X]. Suppose that there is a prime element p in R such that 
(i) p/a; foralli < n, 
(ii) p does not divide ay, and 
(iii) p- does not divide ao. 
Then, f (X) is irreducible in F(X], where F is the field of fractions of R. Further, if 
F(X) is primitive, then it is irreducible in R[X] also. 
Proof Consider the residue polynomial 


F(X) = & + GX + @X? +--+) + GX" = aX" 


(for p/a; for alli < n) of f(X) in R/g[X], where g is the prime ideal generated 
by p. Note that deg(f(X)) > 1. Suppose that f(X) is reducible in F[X]. Then, 
f (X) is product of two polynomials in R[X] of positive degrees. Suppose that 


F(X) = g(X)h(X), 
where deg(g(X)) > Oand deg(h(X)) > 0. Then 


FQ) = GX” = 2K): 
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Clearly, deg(g(X)) 
Hence 


deg(g(X)) > Oand deg(h(X)) = deg(h(X)) > 0. 


2(X) = b,X’andh(X) = GX‘, 


where r,s > 1 and b, +0 4<,. But, then g(X) and h(X) are polynomials in R[X] 
whose constant terms are divisible by p. Since f(X) = g(X)h(X), this would 
imply that the constant term ap of f(X) is divisible by p*. This is a contradiction to 
the hypothesis. Thus, f(X) is irreducible in F[X]. In turn, if in addition to it, f(X) 
is primitive also, then it is irreducible in R[X] also. tt 


Corollary 11.5.26 Let p be a positive prime in Z. Then the Cyclotomic polynomial 
fp(X) =1+X + x? ee = xP 


is irreducible in Q[X]. 


Proof Suppose that f,(X) is reducible in Q[X]. Then f,,(X + 1) is also reducible. 
Now, 


(X+1)?-1 


= p-l p24, = 
fo(X +1) = (K+ VPC + (KF YPC +--+ (X44 YN 41 C11 


Using the binomial theorem, we see that 
fro(X+1) = xeta PC, xP? tee $ PCy. 


Since ?C;, 1 < i < p—Lis divisible by p and’C,_; = p isnot divisible by p*, by 
the Eisenstein irreducibility criteria f,,(X + 1) should be irreducible, a contradiction 
to the supposition. Hence, f,,(X) is irreducible. tt 


Corollary 11.5.27 Let F be a field and f (X, Y) an element of F(X, Y], where 
f (X,Y) = ao(X) + ay(X)¥ + ao(X)¥? + + + ag (X)Y". 


Suppose that X/a;(X) for alli <n —1, X does not divide ay,(X), and X* does not 
divide aj(X). Then, f (X, Y) is irreducible in F(X)[Y], where F(X) is the field of 
fractions of F(X]. Further, if g.c.d of coefficients of powers of Y in f (X, Y) is a unit 
in F[X], then f (X, Y) is irreducible in F[X, Y}. 


Proof Follows from the Eisenstein irreducibility criteria if we note that F[X] is a 
U.ED, and X is a prime element of F[X]. t 


Example 11.5.28 The polynomial X° + 6X? + 3X + 6is irreducible in Q[X] and 
also in Z[X], for 3 is a prime element of Z which satisfies the Eisenstein irreducibility 
criteria. The above polynomial is also primitive. 
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Exercises 


11.5.1 Show that Z[./—5] is not a U-ED but every nonzero nonunit element is 
expressible as product of irreducible elements. Also any two factorization contains 
the same number of irreducible factors. 


11.5.2 Show that 3 + /—7 is irreducible in Z[,/—7]. Is it prime? Is Z[./—7] a 
U.ED? 


11.5.3 Give an example of a commutative integral domain with identity in which 
not every nonzero nonunit element is expressible as product of irreducible elements. 


11.5.4 Show by means of an example that homomorphic image of a U.E.D need not 
be a U.ED even if it is an integral domain. 


11.5.5 Show that the ring RIX] of formal power series over the field R of real 
numbers is a U.E.D. 


11.5.6 Prove that the following polynomials are irreducible in Q[X]. 
(i) X° — 3X? + 6X + 15. 
(ii) X? + 6X? + 7. 
ily Xe BOP a TIN a 1 Se 10, 


11.5.7 Show that the polynomial 
G47 4.2% 40-74 xX 


is irreducible in Z[i][X]. 


11.5.8 Show that 1+ X¥ +2(1+ X)¥+3(1+ X)?¥*+Y? is irreducible in C[X, Y], 
where C is the field of complex numbers. 


11.5.9 Factor X* + X? + 1 in to irreducible factors of Q[X]. 
11.5.10 Show that Z[i] and Z[./—5] are not isomorphic. 
11.5.11 Show that Z[X] and Z[i] are not isomorphic. 


11.5.12 Show that X* — 10X? — 1 is irreducible in Q[X]. Is it irreducible modulo 
2,3 or 5? 


11.5.13 Are Z[i] and Z[w] isomorphic? 
11.5.14 Show that if p = a? +b? is irreducible in Z, then a + bi is prime in Z[i]. 


11.5.15 Suppose that f(X) ¢ Z[X] is reducible. Show that there exists mo € N such 
that for every prime p > no, the polynomial f(X) is reducible in Z,[X]. 


11.5.16 Let p be a prime integer in Z. Suppose that it is not prime in Z[i]. Show 
that there is a prime element z = a+ bi in Z[i] such that p = zz. 


11.5.17 Let p be a prime integer in Z. Show that p is prime in Zi] if and only if 
p =3(mod 4). 


Appendix 


Category Theory 


In this book, we introduced and studied several mathematical structures,viz. semi- 
groups, groups, rings, and fields. We noticed some common features in the study of 
these structures. The category theory gives a unified general and abstract setting for 
all these and many more mathematical structures such as modules, vector spaces, and 
topological spaces. Quite often, in mathematics, the concrete results are expressed in 
the language of category theory. This appendix introduces the very basics in category 
theory for the purpose. The Gédel—Bernays axiomatic system for sets is the most 
suitable axiomatic system for category theory. As described in Chap. 2, class is a 
primitive term in this axiomatic system instead of sets. Indeed, sets are simply the 
members of classes. The classes which are not sets are termed as proper classes. 


Definition A category & consists of the following: 

1. Aclass Obj= called the class of objects of X. 

2. For each pair A, B in ObjX, we have a set Mory (A, B) called the set of mor- 
phisms from the object A to the object B. Further, 


Mors (A, B) (\ Mors (A’, B’) 4 if and only ifA = A’ andB = B’. 


3. For each triple A,B,C in ObjX, we have a map - from Mors(B,C) x 
Mory (A, B) to Mory(A, C) called the law of composition. We denote the image 
-(g,f) by gf. Further, the law of composition is associative in the sense that if 
Ff € Mors(A, B), g € Mory(B, C) andh € Mors(C, D), then (hg)f = h(gf). 

4. For each A € Obj&, there is an element J, in Mory(A, A) such that fl4 = f 
for all morphisms from A, and gl, = g for all morphisms to A. 

Clearly, for each object A of &, J, is unique morphism, and it is called the identity 
morphism on A. The category © is called a small category if Obj is a set. 


Examples 1. We have the category SET of sets whose objects are sets, and the 
morphisms from a set A to a set B are precisely the maps from A to B. 

2. There is a category GP of groups whose objects are groups, and the morphisms 
from a group H to a group K are homomorphisms from H to K. 


© Springer Nature Singapore Pte Ltd. 2017 421 
R. Lal, Algebra 1, Infosys Science Foundation Series in Mathematical Sciences, 
DOI 10.1007/978-98 1-10-4253-9 


422 Appendix 


3. SG denotes the category of semigroups all of whose objects are semigroups, 
and morphisms are semigroup homomorphisms. 

4. AB denotes the category of abelian groups whose objects are abelian groups, 
and morphisms are group homomorphisms. 

5. RING denotes the category of rings whose objects are rings, and morphisms 
are ring homomorphisms. 

6. TOP denotes the category of topological spaces whose objects are topological 
spaces, and the morphisms are continuous maps. 

7. A group G can also be treated as a category having a single object G. The 
elements of the group can be taken as morphisms from G to G, and the composition 
of the morphisms is the binary operation of G. 

Let & be a category. A morphism f from A to B is said to be a monomorphism 
(epimorphism) if it can be left (right) canceled in the sense that fg = fh(gf = hf) 
implies that g = h. A morphism f from A to B is said to be an isomorphism if 
there is a morphism g from B to A such that gf = I, andfg = Ig. Clearly, sucha 
morphism g is unique, and it is called the inverse of f. The inverse of f, if exists, is 
denoted by f~!. 


Remark In the category SET of sets, a morphism f from a set A to a set B is a 
monomorphism (epimorphism) if and only if it is an injective (surjective) map. Also 
in the category GP, a morphism f from a group H to a group K is a monomorphism 
if and only iff is injective (prove it). Clearly, an isomorphism is a monomorphism as 
well as an epimorphism. However, a morphism which is a monomorphism as well as 
epimorphism need not be an isomorphism. For example, consider the category Haus 
of Hausdorff topological spaces. The inclusion map from Q to R is a monomorphism 
as well as an epimorphism (it is an epimorphism because Q is dense in R) but it is 
not an isomorphism. 

Let & be a category and A be an object of &. Then, Mors (A, A) is a monoid with 
respect to the composition of morphisms. The members of Mory (A, A) are called 
the endomorphisms of A. The monoid Mors (A, A) is denoted by End(A). An iso- 
morphism from A to A is called an automorphism of A. The set of all automorphisms 
of A is denoted by Aut(A) which is a group under the composition of morphisms. 

Let & be a category. We say that a category I" is a subcategory of & if (i) 
ObjT C &, (Gi) for each pair A, B € ObjT, Morp(A, B) C Mory(A, B), and (iii) 
the law of composition of morphisms in I is the restriction of the law of composi- 
tion of morphisms in & to I’. The subcategory I is said to be a full subcategory if 
Mor, (A, B) = Morys(A, B) for all A, B € ObjT’. AB is a full subcategory of GP. 
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Functors 


Definition Let & and IT be categories. A functor F from & to I is an association 
which associates with each member A € ObjX, a member F(A) of I’, and to each 
morphism f € Morys(A, B), a morphism F(f) € Morp(F(A), F(B) such that the 
following two conditions hold: 


(i) F(gf) = F(g)F(f) whenever the composition gf is defined. 
Gi) FU4) = Tra) for allA € &. 


Let & be a category. Consider the category &° whose objects are same as the 
objects of 2, Mors (A, B) = Mory.(B, A), and the composition fg in X° is same 
as gf in X&. The category X° is called the opposite category of X. A functor from X° 
to the category I is called a contravariant functor from & tol’. 

If & is acategory, then the identity map Jgp;5 from Obj X to itself defines a functor 
called the identity functor. Composition of any two functors is again a functor. 


Examples 1. Let H be a group. Denote its abelianizer H/[H, K] by Ab(#). Let f be 
a homomorphism from H to a group K. Then f induces ahomomorphism Ab(f) from 
Ab(Hf) to Ab(K) defined by Ab(f)(A[H, H]) = f(h)[K, K]. This defines a functor 
Ab from the category GP of groups to the category AB. This functor is called the 
abelianizer functor. 

2. Let H be a group. Denote the commutator [H, H] of H by Comm(#). If f be a 
homomorphism from H to K, it induces ahomomorphism Comm(f) from Comm(H) 
to Comm(K) defined by Comm(f)({a, b]) = [f(a),f(b)]. This defines a functor 
Comm from the category GP to itself. This functor is called the commutator functor. 

3. We have a functor 2 from the category GP of groups to the category SET of 
sets which simply forgets the group structure and retains the set part of the group. 
More explicitly, Q(G, 0) = G. Such a functor is called a forgetful functor. There is 
another such functor from the category RING of rings to the category AB of abelian 
groups which forgets the ring structure, but retains the additive group part of the ring. 
There is still another forgetful functor from the category TOP to the category SET 
which forgets the topological structure, and retains the set part of the space. 

4. Let f be a map from a set X toa set Y. Then, f induces a unique homomorphism 
F(f) from the free group F(X) to the free group F(Y). This gives us the functor F 
from the category SET to the category GP. This functor is called the free group 
functor. 

5. Let & be a category and A be an object of &. For each B € ObjX, we put 
Mory (A, —)(B) = Mory(A, B), and for each morphism/ from B to C, we have amap 
Mory (A, —)(f) from Mory (A, B) to Mory(A, C) defined by Mors (A, —)(f)(g) = 
fg. It is easily verified that Mory(A, —) defined above is a functor from & to the 
category SET of sets. Similarly, we have a contravariant functor Mory(—, A) from 
the category & to the category SET of sets. 
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Remark There is a useful important functor, viz, homotopy group functor 7; from 
the category TOP* of pointed topological spaces to the category GP of groups. It has 
tremendous application in geometry and topology. This functor will be discussed in 
Algebra 3. 

Let F be a functor from a category & to a category I’. The functor F is said to be 
faithful if for each pair A, B € Obj&, the induced map f +> F(f) from Mors (A, B) 
to Morr (F(A), F(B)) is injective. The functor F is said to be a full functor if these 
induced maps are surjective. The forgetful functor from GP to SET is faithful but 
it is not full. The abelianizer functor Ab is not faithful. A functor F is said to be an 
isomorphism from the category & to the category I if there is a functor G from T° 
to & such that GoF = Jy and FoG = [,. 


Natural Transformations 


Definition Let F and G be functors from a category & to a category I’. A natural 
transformation 7 from F to G is a family {74 € Morr(F(A), G(A)) | A € Objd} of 
morphisms in I’ such that the diagram 


F(A) 4S GA) 
F(f) G(f) 
7B 


F(B) —__________,G(B) 


is commutative for all morphisms f in D. 


Examples 1. Let vg denote the quotient homomorphism from G to G/[G, G]. Then, 
the family {vg | G ¢ ObjGP} defines a natural transformation v from the identity 
functor [gp to the abelianizer functor Ab. 

2. For each set X, we have the inclusion map ix from X to Q(F(X)) where 
F is the free group functor, and Q is the forgetful functor. Evidently, the family 
{ix | X € ObjSET} of maps defines a natural transformation from the identity 
functor Isg7 to the functor QoF on the category SET. 

Let F and G be two functors from a category & to a category I. A natural 
transformation 77 from F to G is called a natural equivalence if 74 is an isomorphism 
from F(A) to G(A) for all A € Obj&. This is equivalent to say that there is a natural 
transformation p from G to F such that paony = Iria) and nyop, = Ig a) for all 
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objects A of &. A functor F from & to T is called an equivalence from & to T’ if 
there is a functor G from I’ to & such that FoG and GoF are naturally equivalent 
to the corresponding identity functors. Notice that there is a difference between 
isomorphism and equivalence between categories. An equivalence need not be an 
isomorphism. 

Let & and I be categories. Then, & x I represents the category whose objects 
are pairs (A, B) € ObjX x ObjY, and a morphism from (A, B) to (C, D) is a pair 
(f, g), where f is a morphism from A to C in &, and g is a morphism from B to D 
in I’. The composition law is coordinate-wise. This category is called the product 
category. 

Let F bea functor froma category © to acategory I’. Letf be amorphism from C to 
Ain &, and g bea morphism from B to Din’. This defines amap Morr (F (f), g) from 
the set Mor, (F(A), B) to Morr (F(C), D) given by Morr (F(f), g)(h) = ghF(f). 
This defines a functor Morr (F(—), —) from the product category X° x IT’ to the 
category SET of sets. Similarly, given a functor G from I’ to ©, we have a functor 
Mors(—, G(—)) to the category SET of sets. We say that F is left adjoint to G, 
or G is right adjoint to F if there is a natural isomorphism 7 from the functor 
Morr (F(—), —) to the functor Mors (—, G(—)). More explicitly, for each object A 
in & and each object B in I’, we have a bijective map 74,2 from Morr (F(A), B) to 
the set Mors (A, G(B)) such that Mors (f, G(g))ona.3 = Nc.poMorr(F(f), g) for 
all morphisms (f, g) in &° x T° (look at the corresponding commutative diagram). 


Examples 1. Consider the category SET of sets and the category GP of groups. We 
have the free group functor F from the category SET to the category GP. More 
explicitly, for each set X, we have the free group F(X) on the set X. We also have 
the forgetful functor Q from GP to SET. From the universal property of free group, 
every group homomorphism f from F(X) to G determines and is uniquely determined 
by its restriction to X. This gives us a bijective map nx,g from Hom(F(X), G) to 
Map(X, &(G)). It is easy to observe (using the universal property of a free group) 
that 7, thus obtained, is a natural equivalence. Hence, the free group functor F is left 
adjoint to the forgetful functor Q. 

2. We have the forgetful functor Q from the category AB of abelian groups to the 
category GP of groups. We also have the abelianizer functor Ab from GP to AB. It 
can be easily verified that Ab is left adjoint to Q. 


Products and Coproducts in a Category 


Definition Let A and B be objects in a category &. A product of A and B in & is a 
triple (P, f, g), where P is an object of the category &, f is morphisms from P to A, 
and g is a morphism from P to B such that given any such triple (P’, f’, g’), there is 
a unique morphism ¢ from P’ to P such that f¢@ = f’ andg¢d = g’. 

It is easily observed from the definition that if (P,f, g) and (P’, f’, g’) are two 
products of A and B, then there is an isomorphism ¢ from P’ to P with f@ = f’ and 


426 Appendix 


gd = g'. Thus, if the product exists, then it is unique up to natural isomorphism. 
The product of A and B is usually denoted by A x B. 

In the category SET of sets, the Cartesian product A x B with the corresponding 
projection maps is the product in the category SET. Similarly, the direct product 
H x K of the groups H and K together with the corresponding projection maps is 
the product of H and K in the category GP. 

Dually, we have the following: 


Definition Let A and B be objects in a category &. A coproduct of A and B in & is 
a triple (U, f, g), where U is an object of the category &, f is morphisms from A to 
U, and g is a morphism from B to U such that given any such triple (U’, f’, g’), there 
is a unique morphism ¢ from U to U’ such that ¢f = f’ and gg = g’. 

It is easily observed from the definition that if (U,f, g) and (U’, f’, g’) are two 
coproducts of A and B, then there is anisomorphism ¢ from U to U' with ¢f = f’ and 
og = g'. Thus, if the co-product exists, then it is unique up to natural isomorphism. 
The coproduct of A and B is denoted by A [| B. 

In the category SET of sets, the disjoint union (A x {0}) U (B x {1}) of A and 
B with the natural inclusion maps is the coproduct of A and B in the category SET. 
Similarly, the free product H x K of the groups H and K together with the natural 
inclusion maps is the coproduct of H and K in the category GP. 


PullBack and Push-out Diagrams 


Definition Let © be a category. Let f € Mory(A, C), and g € Mory(B,C). A 
commutative diagram 


g 


is said to be a pullback diagram if given any commutative diagram 
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g 


there exists a unique morphism ¢ from D to P such thath@ = pandké = v. 

Dually, a push-out diagram can be defined by reversing the arrows in the definition. 
The reader is advised to formulate the definition of push-out diagram. 

In general, pullback and push out need not exist in a category. However, they exist 
in the category SET of sets and also in the category GP of groups: Let f be a morphism 
from A to C, and g be a morphism from B to C in the category SET / GP. Consider the 
product A x B in the category SET/GP. Let P = {(a,b) €AxB|f(a) = g(b)}. 
Let h denote first projection from P to A, and k denotes the second projection from 
P to B. This gives us a pullback diagram in SET /GP. Similarly, push out also exists 
in the category SET, and also it exists in the category GP. 
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